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NOVEL COMPOUNDS 

Field of the Invention; 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the uses of such polynucleotides and polypeptides, as well as the 
5 production of such polynucleotides and polypeptides and recombinant host cells 
transformed with the polynucleotides. This invention also relates to inhibiting the 
biosynthesis or action of such polynucleotides or polypeptides and to the use of such 
inhibitors in therapy. Preferred embodiments of the invention include Streptococcal 
polypeptides and polynucleotides, particularly those of Streptococcus pneumoniae. 
10 Background of the Invention; 

The Streptococci make up a medically important genera of microbes known to 
cause several types of disease in humans, including otitis media, pneumonia and meningitis. 
Since its isolation more than 100 years ago, Streptococcus pneumoniae has been one of the 
more intensively studied microbes. For example, much of our early understanding that 
15 DNA is, in fact, the genetic material was predicated on the work of Griffith and of Avery, 
Macleod and McCarty using this microbe. Despite the vast amount of research with S. 
pneumoniae, many questions concerning the virulence of this microbe remain. 

While certain Streptococcal factors associated with pathogenicity have been 
identified, e.g., capsule polysaccharides, peptidoglycans, pneumolysins, PspA Complement 
20 factor H binding component, autolysin, neuraminidase, peptide permeases, hydrogen 
peroxide, IgAl protease, the list is certainly not complete. Further very little is known 
concerning the temporal expression of such genes during infection and disease progression 
in a mammalian host. Discovering the sets of genes the bacterium is likely to be expressing 
at the different stages of infection, particularly when an infection is established, provides 
25 critical information for the screening and characterization of novel antibacterials which can 
interrupt pathogenesis. In addition to providing a fuller understanding of known proteins, 
such an approach will identify previously unrecognised targets. 
Brief Description of the Invention; 

This invention provides novel proteins, particularly those from Streptococcus 
30 pneumoniae, strain 0100993, characterised in that it comprises the amino acid sequences 
given herein or fragments, analogues or derivatives thereof. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 



1 
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In particular the invention provides polynucleotides having the DNA sequences 
given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act as PCR primers in the process herein described to determine 
5 whether or not the Streptococcus pneumoniae gene identified herein in whole or in part is 
expressed in infected tissue. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. The 
proteins so identified are also useful as targets in screens designed to identify antimicrobial 
compounds. 

10 It is an object of the invention to provide polypeptides that have been identified as novel 

polypeptides by homology between the amino acid sequence set out in the Sequence Listing and 
a known amino acid sequence or sequences of other proteins such as the proteins identified under 
the heading Identity in Table 1 . 

It is a further object of the invention to provide polynucleotides that encode polypeptides 

15 set forth in the Sequence Listing, particularly polynucleotides that encode the polypeptide set 
forth in the Sequence Listing. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding polypeptides comprising the sequence set out in the Sequence Listing, or a 
variant thereof. 

20 In another particularly preferred embodiment of the invention there is a novel 

protein from comprising an amino acid sequence of the Sequence Listing, or a variant 
thereof. 

In accordance with another aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 

25 genetic immunization. Among the particularly preferred embodiments of the invention are 
naturally occurring allelic variants the polynucleotides set forth in the Sequence Listing and 
polypeptides encoded thereby. 

In accordance with yet another aspect of the invention, there are provided inhibitors 
to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 

30 In accordance with certain preferred embodiments of the invention, there are provided 

products, compositions and methods for assessing expression of the sequences the Sequence 
Listing, treating disease, for example, otitis media, conjunctivitis, pneumonia, bacteremia, 
meningitis, sinusitis, pleural empyema and endocarditis, and most particularly meningitis, 
such as for example infection of cerebrospinal fluid, assaying genetic variation, and 
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administering a polypeptide or polynucleotide of the invention to an organism to raise an 
immunological response against a bacteria, especially a Streptococcus pneumoniae bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the 
invention there are provided polynucleotides that hybridize to polynucleotide sequences of the 
5 invention, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against 
polypeptides of the invention. 

In other embodiments of the invention there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
10 polypeptide or polynucleotide of the invention comprising: contacting a polypeptide or 

polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or polynucleotide to 
assess the binding to or other interaction with the compound, such binding or interaction being 
associated with a second component capable of providing a detectable signal in response to the 
15 binding or interaction of the polypeptide or polynucleotide with the compound; and determining 
whether the compound binds to or otherwise interacts with and activates or inhibits an activity of 
the polypetide or polynucleotide by detecting the presence or absence of a signal generated from 
the binding or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided agonists and 
20 antagonists of the polynucleotide and/or polypeptides of the invention, preferably bacteriostatic 
or bacteriocidal agonists and antagonists. 

In a further aspect of the invention there are provided compositions comprising a 
polynucleotide or a polypeptide of the invention for administration to a cell or to a multicellular 
organism. 

25 Various changes and modifications within the spirit and scope of the disclosed invention 

will become readily apparent to those skilled in the art from reading the following descriptions 
and from reading the other parts of the present disclosure. 
Detailed Desc ription of the Invention: 

Each of the DNA sequences provided herein may be used in the discovery and 

30 development of antibacterial compounds. Because each of the sequences contains an open 
reading frame (ORF) with appropriate initiation and termination codons, the encoded 
protein upon expression can be used as a target for the screening of antimicrobial drugs. 
Additionally, the DNA sequences encoding preferably the amino terminal regions of the 
encoded protein or the Shine-Delgamo region can be used to constmct antisense sequences 
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to control the expression of the coding sequence of interest. Furthermore, many of the 
sequences disclosed herein also provide regions upstream and downstream from the 
. encoding sequence. These sequences are useful as a source of regulatory elements for the 
control of bacterial gene expression. Such sequences are conveniently isolated by 
5 restriction enzyme action or synthesized chemically and introduced, for example, into 
promoter identification strains. These strains contain a reporter structural gene sequence 
located downstream from a restriction site such that if an active promoter is inserted, the 
reporter gene will be expressed. 

Although each of the sequences may be employed as described above, this 

10 invention also provides several means for identifying particularly useful target genes. The 
first of these approaches entails searching appropriate databases for sequence matches in 
related organisms. Thus, if a homologue exists, the Streptococcal-like form of this gene 
would likely play an analogous role. For example, a Streptococcal protein identified as 
homologous to a cell surface protein in another organism would be useful as a vaccine 

15 candidate. To the extent such homologies have been identified for the sequences disclosed 
herein they are reported along with the encoding sequence. 

Recently techniques have become available to evaluate temporal gene expression in 
bacteria, particularly as it applies to viability under laboratory and infection conditions. A 
number of methods can be used to identify genes which are essential to survival per se f or 

20 essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and 
permits the selection of such an ORF for further development as a screening target. Briefly, 
these approaches include: 
1) Signature Tagged Mutagenesis (STM) 

25 This technique is described by Hensel £l al„ Science 269: 400-403(1995), the 

contents of which is incorporated by reference for background purposes. 
Signature tagged mutagenesis identifies genes necessary for the establishment/maintenance 
of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various 

30 means (e.g., transposons) such that unique DNA sequence tags are inserted in close 

proximity to the site of mutation. The tags from a mixed population of bacterial mutants 
and bacteria recovered from an infected hosts are detected by amplification, radiolabeling 
and hybridisation analysis. Mutants attenuated in virulence are revealed by absence of the 
tag from the pool of bacteria recovered from infected hosts. 
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In Streptococcus pneumnn)^ because the transposon system is less well 
developed, a more efficient way of creating the tagged mutants is to use the insertion- 
duplication mutagenesis technique as described by Morrison et a!.. J. Bacteriol. 159:870 
(1984) the contents of which is incorporated by reference for background purposes. 
5 2) In Vivo Expression Technology (IVET) 

This technique is described by Camilli el al„ Proc . Nat l . Acad . Sci. USA . 9 i :2634- 
2638 (1994), the contents of which is incorporated by reference for background purposes. 
IVET identifies genes up-regulated during infection when compared to laboratory 
cultivation, implying an important role in infection. ORF identified by this technique are 
10 implied to have a significant role in infection establishment/maintenance. 

In this technique random chromosomal fragments of target organism are cloned 
upstream of a promoter-less recombinase gene in a plasmid vector. This construct is 
introduced into the target organism which carries an antibiotic resistance gene flanked by 
resolvase sites. Growth in the presence of the antibiotic removes from the population those 
15 fragments cloned into the plasmid vector capable of supporting transcription of the 

recombinase gene and therefore have caused loss of antibiotic resistance. The resistant pool 
is introduced into a host and at various times after infection bacteria may be recovered and 
assessed for the presence of antibiotic resistance. The chromosomal fragment carried by 
each antibiotic sensitive bacterium should carry a promoter or portion of a gene normally 
20 upregulated during infection. Sequencing upstream of the recombinase gene allows 
identification of the up regulated gene. 
3) Differential display 

This technique is described by Chuang el al, J. Bacteriol 1 75:2026-2036 (1993), 
the contents of which is incorporated by reference for background purposes. This method 
25 identifies those genes which are expressed in an organism by identifying mRNA present 
using randomly-primed RT-PCR. By comparing pre-infection and post infection profiles, 
genes up and down regulated during infection can be identified and the RT-PCR product 
sequenced and matched to ORF 'unknowns'. 



5 
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4) Generation or conditional lethal mutants by transpos n mutagenesis. 

This technique, described by de Lorenzo, V. & al. t Gene 123:17-24 (1993); 
Neuwald, A. F. si it. Gens 125: 69-73(1993); and Takiff, H. E. et aL, J. Bacteriol . 
1 74: 1544- 1553( 1992), the contents of which is incorporated by reference for 
5 background purposes, identifies genes whose expression are essential for cell viability. 

In this technique transposons carrying controllable promoters, which provide 
transcription outward from the transposon in one or both directions, are generated. Random 
insertion of these transposons into target organisms and subsequent isolation of insertion 
mutants in the presence of inducer of promoter activity ensures that insertions which 

10 separate promoter from coding region of a gene whose expression is essential for cell 

viability will be recovered. Subsequent replica plating in the absence of inducer identifies 
such insertions, since they fail to survive. Sequencing of the flanking regions of the 
transposon allows identification of site of insertion and identification of the gene disrupted. 
Close monitoring of the changes in cellular processes/morphology during growth in the 

15 absence of inducer yields information on likely function of the gene. Such monitoring 
could include flow cytometry (cell division, lysis, redox potential, DNA replication), 
incorporation of radiochemical^ labeled precursors into DNA, RNA, protein, lipid, 
peptidoglycan, monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

20 5) Generation of conditional lethal mutants by chemical mutagenesis. 

This technique is described by Beckwith, J.. Methods in Enzvmologv 204: 
3-18(1991), the contents of which are incorporated herein by reference for background 
purposes. In this technique random chemical mutagenesis of target organism, growth at 
temperature other than physiological temperature (permissive temperature) and subsequent 

25 replica plating and growth at different temperature (e.g. 42°C to identify ts, 25°C to identify 
cs) are used to identify those isolates which now fail to grow (conditional mutants). As 
above close monitoring of the changes upon growth at the non-permissive temperature 
yields information on the function of the mutated gene. Complementation of conditional 
lethal mutation by library from target organism and sequencing of complementing gene 

30 allows matching with unknown ORF. 
6) RT-PCR 

Streptococcus pneumoniae messenger RNA is isolated from bacterial infected tissue e.g. 48 
hour murine lung infections, and the amount of each mRNA species assessed by reverse 
transcription of the RNA sample primed with random hexanucleotides followed by PCR 

6 
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with gene specific primer pairs. The determination of the presence and amount of a 
particular mRNA species by quantification of the resultant PCR product provides 
information on the bacterial genes which are transcribed in the infected tissue. Analysis of 
gene transcription can be carried out at different times of infection to gain a detailed 
knowledge of gene regulation in bacterial pathogenesis allowing for a clearer understanding 
of which gene products represent targets for screens for novel antibacterials. Because of 
the gene specific nature of the PCR primers employed it should be understood that the 
bacterial mRNA preparation need not be free of mammalian RNA. This allows the 
investigator to carry out a simple and quick RNA preparation from infected tissue to 
obtain bacterial mRNA species which are very short lived in the bacterium (in the order of 2 
minute halflives). Optimally the bacterial mRNA is prepared from infected murine lung 
tissue by mechanical disruption in the presence of TRIzole (GIBCO-BRL) for very short 
periods of time, subsequent processing according to the manufacturers of TRIzole reagent 
and DNAase treatment to remove contaminating DNA. Preferably the process is optimised 
1 5 by finding those conditions which give a maximum amount of S.pneumoniae 1 6S 

ribosomal RNA as detected by probing Northerns with a suitably labelled sequence specific 
oligonucleotide probe. Typically a 5' dye labelled primer is used in each PCR primer pair 
in a PCR reaction which is terminated optimally between 8 and 25 cycles. The PCR 
products are separated on 6% polyacrylamide gels with detection and quantification using 
20 GeneScanner (manufactured by ABI). 

Each of these techniques may have advantages or disadvantage depending on the 
particular application. The skilled artisan would choose the approach that is the most 
relevant with the particular end use in mind. For example, some genes might be recognised 
as essential for infection but in reality are only necessary for the initiation of infection and 
so their products would represent relatively unattractive targets for antibacterials developed 
to cure established and chronic infections. 

Use of the of these technologies when applied to the ORFs of the present invention 
enables identification of bacterial proteins expressed during infection, inhibitors of which 
would have utility in ami- bacterial therapy. 

Streptococcus pneumoniae, strain 01 00993 has been deposited at the National 
Collection of Industrial and Marine Bacteria Ltd. (NCIMB). Aberdeen. Scotland under 
NCIMB number 40794 on 1 1 April 1996. and a Streptococcus pneumoniae, strain 0100993 
DNA library in E. coli was similarly deposited on 17 April 1996 under NCIMB number 
40800. 
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The nucleotide sequences disclosed herein can be obtained by synthetic chemical 
techniques known in the art or can be obtained from Streptococcus pneumoniae, strain 
0100993by probing a DNA preparation with probes constructed from the particular 
sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed 
5 sequence can act as PCR primers in a process of PCR- based cloning of the sequence from a 
bacterial genomic source. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given 
herein typically a library of clones of chromosomal DNA of Streptococcus pneumoniae, 

10 strain 0100993 in E. coli or some other suitable host is probed with a radiolabeled 

oligonucleotide, preferably a 17mer or longer, derived from the partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using high stringency 
washes. By sequencing the individual clones thus identified with sequencing primers 
designed from the original sequence it is then possible to extend the sequence in both 

15 directions to determine the full gene sequence. Conveniently such sequencing is performed 
using denatured double stranded DNA prepared from a plasmid clone. Suitable techniques 
are described by Maniatis, T., Fritsch, E.F. and Sambrook, J. in MOLECULAR CLONING, 
A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory (see: Screening 
By Hybridization 1.90 and Sequencing Denatured Double-Stranded DNA Templates 

20 13.70). 

A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 

25 may be identical to the coding sequence shown or may be a different coding sequence 

which coding sequence, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 

30 characterized by the deduced amino acid sequence given herein. The variant of the 

polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. In addition to the standard A, G, C, T/U 
representations for nucleic acid bases, the term "N" is also used. "N" means that any of the 
four DNA or RNA bases may appear at such a designated position in the DNA or RNA 



WO 97/37026 PCT/US97/0S306 
sequence, except that N cannot be a base that when taken in combination with adjacent 
nucleotide positions, when read in the correct reading frame, would have the effect of 
generating a premature termination codon in such reading frame. 

Thus, the present invention includes polynucleotides encoding the same 
5 polypeptide characterized by the deduced amino acid sequence given herein as well as 
variants of such polynucleotides which variants encode for a fragment, derivative or 
analogue of the polypeptide. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
1 0 allelic variant of the coding sequence characterized by the DNA sequence disclosed herein. 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, may include only 
1 5 the coding sequence for the mature polypeptide or the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory sequence or a 
proprotein sequence. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
20 polynucleotide Which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
controlling transport of a polypeptide from the cell. The polypeptide having a leader 
sequence is a preprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5" amino acid residues. A mature protein having 
a prosequence is a proprotein and may be an inactive form of the protein. Once the 
30 prosequence is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may code for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a presequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH r terminus. It is appreciated, however, 
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that during post-translational modification of the peptide, this residue may be deleted. 
Accordingly* this invention contemplates the use of both the methionine-containing and the 
methionineless amino terminal variants of each protein disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence 
5 fused in frame to a marker sequence at either the 5' or 3' terminus of the gene which allows 
for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by Quiagen 
Inc.) to provide for purification of the polypeptide fused to the marker in the case of a 
bacterial host. 

10 The present invention further relates to polynucleotides which hybridize to the 

hereinabove-described sequences if there is at least 50% and preferably at least 70% identity 
between the sequences. The present invention particularly relates to polynucleotides, 
particularly Streptococcal polynucleotides, which hybridize under stringent conditions to 
the hereinabove-described polynucleotides . As herein used, the term "stringent conditions" 

15 means hybridization will occur only if there is at least 95% and preferably at least 97% 
identity between the sequences. The polynucleotides which hybridize to the hereinabove 
described polynucleotides in a preferred embodiment encode polypeptides which retain 
substantially the same biological function or activity as the polypeptide characterised by the 
deduced amino acid sequence given herein. An example of stringent hybridization 

20 conditions is overnight incubation at 42°C in a solution comprising: 50% formamide, 5x 
SSC (150mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x 
Denhardt s solution, 10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon 
sperm DNA, followed by washing the hybridization support in 0.1 x SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook, et al. % 

25 Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., 
(1989), particularly Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence selected from the group consisting of the 

30 polynucleotides of the Sequence Listing under stringent hybridization conditions with a 

probe having the sequence of said polynucleotide sequence set forth in said polynucleotide 
of the Sequence Listing or a fragment thereof; and isolating said DNA sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers described elsewhere herein. 
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"Identity," as known in the an and used herein, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be. as determined by the match 
between strings of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in {Computational Molecular 
Biology, Lesk. A.M.. ed.. Oxford University Press, New York. 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W.. ed.. Academic Press. New York, 1993; 
Computer Analysis of Sequence Data, Part I, Griffin, A.M., and Griffin, H.G., eds., 
Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov. M. and Devereux J eds 

MStockt 0 nPress.NewYork.l991;andCarillo.H.,andLipman,D.,SIAM J. Applied 
Matn.. 48: 1073 (1988). Preferred methods to determine identity are designed to g.ve the 
largest match between the sequences tested. Methods to determine identity and similarity 
are codified in publicly available computer programs. Preferred computer program methods 
to determine identity and similarity between two sequences include, but are not limited to. 
the GCG program package (Devereux, J., et a!.. Nucleic Acids Research 12(1): 387 (1984)) 
BLASTP, BLASTN, and FASTA (Atschul. S.F. et al., J. Molec. Biol. 215: 403-410 (1990)'. 
The BLAST X program is publicly available from NCBI and other sources {BLAST 
Manual, Altschul, S., etai, NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. 
Mol. Biol. 215: 403-410 (1 990). 

As an illustration, by a polynucleotide having a nucleotide sequence having at least 
for example, 95% "identity" to a reference nucleotide sequence selected from the group 
cons.stmg of the polynucleotide of the Sequence Listing is intended that the nucleotide 
sequence of the polynucleotide is identical to the reference sequence except that the 
polynucleotide sequence may include up to five point mutations per each 100 nucleotides of 
the reference nucleotide sequence. In other words, to obtain a polynucleotide having a 
nucleoude sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 
the nucleotides in the reference sequence may be deleted or substituted with another 
nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These mutanons of the reference 
sequence may occur at the 5 or 3 terminal positions of the reference nucleotide sequence 
or anywhere between those terminal positions, interspersed either individually among 
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nucleotides in the reference sequenc or in one or more contiguous groups within the 
reference sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, 
for example, 95% identity to a reference amino acid sequence of selected from the 
5 group consisting of the amino acids of the Sequence Listing is intended that the 

amino acid sequence of the polypeptide is identical to the reference sequence except 
that the polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the reference amino acid. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference 

10 amino acid sequence, up to 5% of the amino acid residues in the reference sequence 
may be deleted or substituted with another amino acid, or a number of amino acids 
up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or 

15 anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 

20 Patent Procedure. These deposits are provided merely as convenience to those of skill in 
the art and are not an admission that a deposit is required under 35 U.S.C. § 1 1 2. The 
sequence of the polynucleotides contained in the deposited material, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with any description of sequences herein. A 

25 license may be required to make, use or sell the deposited material, and no such license is 
hereby granted. 

The terms "fragment," "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence herein, means a polypeptide 
which retains essentially the same biological function or activity as such polypeptide. 
30 Thus, an analogue includes a proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide. 
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The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence herein may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be 
5 one encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or(iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
10 purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogues are deemed to be within the scope of those skilled in the art from the teachings 



25 
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herein. 



The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 
1 5 The term "isolated" means that the material is removed from its original 

environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 
20 and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not part of its natural environment. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 

In accordance with yet a further aspect of the present invention, there is therefore 
provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 
recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 

Host cells are genetically engineered (transduced or transformed or transfected) 
with the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be. for example, in the form of a plasmid, a cosmid. a 
phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
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genes. The culture conditions, such as temperature, pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled 
artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
5 DNA sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 

10 endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the E. coli. 
lac or rrp, the phage lambda Pl promoter and other promoters known to control expression 

15 of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector also 

contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 

20 dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and, optionally, an operator (collectively referred to herein as "control" 
elements), so that the DNA sequence encoding the desired protein is transcribed into RNA 

25 in the host cell transformed by a vector containing this expression construction. The coding 
sequence may or may not contain a signal peptide or leader sequence. The polypeptides of 
the present invention can be expressed using, for example, the E. coli tac promoter or the 
protein A gene (spa) promoter and signal sequence. Leader sequences can be removed by 
the bacterial host in post-translational processing. See, e.g., U.S. Patent Nos. 4,43 1,739; 

30 4,425,437; 4,338,397. Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda Pr, Pl and trp. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
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mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
5 of the host cell. Regulatory sequences are known to those of skill in the art, and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
sequences. 

10. An expression vector is constructed so that the particular coding sequence is located 

in the vector with the appropriate regulatory sequences, the positioning and orientation of 
the coding sequence with respect to the control sequences being such that the coding 
sequence is Iranscribed under the "control" of the control sequences (i.e.. RNA polymerase 
which binds to the DNA molecule at the control sequences transcribes the coding 
1 5 sequence). Modification of the coding sequences may be desirable to achieve this end. For 
example, in some cases it may be necessary to modify the sequence so that it may be 
attached to the control sequences with the appropriate orientation; i.e.. to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insertion into a vector, such as the cloning vectors described 
above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 
expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 
identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 

15 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 

5 embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene), 
pQE70, pQE60, pQE-9 (Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, 

0 pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, 

pDR540, pRIT5 (Pharmacia). Eukaryotic: pBlueBacin (Invitrogen), pWLNEO, pSV2CAT, 
pOG44. pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, 
any other plasmid or vector may be used as long as they are replicable and viable in the 
host. 

5 Examples of recombinant DNA vectors for cloning and host cells which they can 

transform include the bacteriophage I (£. co/i), pBR322 (E. coli), pACYC177 (£. coli), 
pKT230 (gram-negative bacteria), pGVl 106 (gram-negative bacteria), pLAFRl (gram- 
negative bacteria), pME290 (non-£. coli gram-negative bacteria). pHVI4 (£. coli and 
Bacillus subtilis), pBD9 (Bacillus), pIJ61 (Streptomyces), pUC6 (Streptomyces), YIp5 

0 (Saccharomyces), a baculovirus insect cell system, YCpl9 (Saccharomyces). See, 

generally, M DNA Cloning": Vols. I & II, Glover et al. ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et al. ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the 
polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

5 Polypeptides can be expressed in host cells under the control of appropriate 

promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, Cold 

0 Spring Harbor, N.Y., ( 1989), the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use 
5 of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
10 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as E. coli may require isolation from 
15 inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression, it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
25 (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that 
functions as an autonomous unit of DNA replication in vivo; i.e., capable of replication 
under its own control. 

A "vector" is a replicon. such as a plasmid, phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
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A "double-stranded DNA molecule" refers to the polymeric form of 
deoxy ribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
helix, both relaxed and supercoiled. This term refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
5 term includes double- stranded DNA found, inter alia, in linear DNA molecules (e.g., 

restriction fragments), viruses, plasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5' to 3' direction along the 
nontranscribed strand of DNA (i.e., the strand having the sequence homologous to the 
10 mRNA). 

A DNA "coding sequence of ' or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 

15 polymerase in a cell and initiating transcription of a downstream (3* direction) coding 

sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the 3* terminus by a translation start codon (e.g., ATG) of a coding sequence and extends 
upstream (5* direction) to include the minimum number of bases or elements necessary to 
initiate transcriptipn at levels detectable above background. Within the promoter sequence 

20 will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
boxes and "CAT" boxes. Prokaryotic promoters contain Shine-Dalgamo sequences in 
addition to the -10 and -35 consensus sequences. 

25 DNA "control sequences" refers collectively to promoter sequences, ribosome 

binding sites, polyadenylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e., the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 

30 RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA, which is then translated into the polypeptide encoded by the coding sequence. 

A "host cell" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous DNA sequence. 

18 
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A cell has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
5 episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or Cones 
comprised of a population of daughter cell containing the exogenous DNA. 
10 A "clone" is a population of cells derived from a single cell or common ancestor by 

mitos,s. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
15 molecule in nature. 

-This invention is also related to the use of the polynucleotides of the invention for use as 
diagnostic reagents. Detecuon of a polynucleotide or po.ypeptide of the invention in a 
eukaryote. particularly a mammal, and especially a human, will provide a diagnostic method for 
dtagnosas of a disease. Eukaryotes (herein also "individual(s)"), particularly mammals and 
espeaally humans, infected with an organism comprising a polynucleotide of the invention may 
be detected at the nucleic acid level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and 
trssues. such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly for 
detection or may be amplified enzymatically by using PGR or other amp.ificauon technique prior 
to analysis. RNA or cDNA may also be used in the same ways. Using amplification, 
characterization of the species and strain of pmkaryote present in an individual, may be made by 
an analysis of the genotype of the prokaryote gene. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the genotype of a reference sequence 
Potnt mutations can be identified by hybridizing amplified DNA to labeled polynucleotide 
sequences of the invention. Perfectly matched sequences can be distinguished from mismatched 
duplexes by RNase digestion or by differences in melung temperatures. DNA sequence 
differences may also be detected by alterations in the electtophoretic mobility of the DNA 
fragment ui gels, with or without denaturing agents, or by direct DNA sequencing See e g. 
Myers et a... Science. 230: ,242 (1985). Sequence changes at specific .ocations also may be 
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revealed by nuclease protection assays, such as RNase and SI protection or a chemical cleavage 
method. See, e.g. f Cotton et al., Proc. Natl Acad. ScL, USA, 85: 4397-4401 
(1985). 

Cells carrying mutations or polymorphisms in the gene of the invention may also be 

5 detected at the DNA level by a variety of techniques, to allow for serotyping, for example. For 
example, RT-PCR can be used to detect mutations. It is particularly preferred to used RT-PCR 
in conjunction with automated detection systems, such as, for example, GeneScan. RNA or 
cDNA may also be used for the same purpose, PCR or RT-PCR. As an example, PCR primers 
complementary to a nucleic acid encoding a polypeptide of the invention can be used to identify 

10 and analyze mutations. These primers may be used for, among othe4r things, amplifying DNA 
isolated from a sample derived from an individual. The primers may be used to amplify the gene 
isolated from an infected individual such that the gene may then be subject to various techniques 
for elucidation of the DNA sequence. In this way, mutations in the DNA sequence may be 
detected and used to diagnose infection and to serotype and/or classify the infectious agent. 

15 The invention further provides a process for diagnosing, disease, preferably bacterial 

infections, more preferably infections by Streptococcus pneumoniae, comprising determining 
from a sample derived from an individual a increased level of expression of polynucleotide 
having a sequence set forth in the Sequence Listing or a sequence of the invention. 
Increased or decreased expression of a polynucleotide of the invention can be measured 

20 using any on of the methods well known in the art for the quantation of polynucleotides, 
such as, for example, amplification, PCR, RT-PCR, RNase protection. Northern blotting 
and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of a polypeptide of the invention compared to normal control tissue samples may be 

25 used to detect the presence of an infection, for example. Assay techniques that can be used to 
determine levels of a polypeptide of the invention, in a sample derived from a host are well- 
known to those of skill in the art. Such assay methods include radioimmunoassays, competitive- 
binding assays, Western Blot analysis and ELISA assays. 

In accordance with yet a further aspect of the present invention, there is provided 

30 the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the use 
of a polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunisation. 

20 
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In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. In particular, there are 
provided antibodies against such polypeptides. 

Polypeptides of the invention may also be used to assess the binding of small molecule 
substrates and ligands in, for example, cells, cell-free preparations, chemical libraries, and natural 
product mixtures. These substrates and ligands may be natural substrates and ligands or may be 
structural or functional mimetics. See. e.g., Coligan et al.. Current Protocols in Immunology 
1(2): Chapter 5 (1991). 

The invention also provides a method of screening compounds to identify those which 
enhance (agonist) or block (antagonist) the action of polypeptides or polynucleotides of the 
invention, particularly those compounds that are bacteriostatic and/or bacteriocidal. The method 
of screening may involve high-throughput techniques. For example, to screen for agonists or 
antagoists. a synthetic reaction mix. a cellular compartment, such as a membrane, cell envelope 
or cell wall, or a preparation of any thereof, comprising a polypeptide of the invention and a 
15 labeled substrate or ligand of such polypeptide is incubated in the absence or the presence of a 
candidate molecule that may be an agonist or antagonist of a polypeptide of the invention. The 
ability of the candidate molecule to agonize or antagonize a polypeptide of the invention is 
reflected in decreased binding of the labeled ligand or decreased production of product from such 
substrate. Molecules that bind gratuitously, U. , without inducing the effects of a polypeptide of 
the invention are most likely to be good antagonists. Molecules that bind well and increase the 
rate of product production from substrate are agonists. Detection of the rate or level of production 
of product from substrate may be enhanced by using a reporter system. Reporter systems that 
may be useful in this regard include but are not limited to colorimetric labeled substrate 
converted into product, a reporter gene that is responsive to changes in polynucleotide or 
25 polypeptide activity, and binding assays known in the art. 

Another example of an assay for antagonists of a polypeptide of the invention is a 
competitive assay that combines such polypeptide and a potential antagonist with polypeptide- 
bmding molecules, recombinant polypeptide-binding molecules, natural substrates or ligands or 
substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay 
Polypeptides of the invention can be labeled, such as by radioactivity or a colorimetric 
compound, such that the number of such polypeptide molecules bound to a binding molecule or 
converted to product can be determined accurately to assess the effectiveness of the potential 
antagonist. 
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Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or 
extinguish its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a binding 
5 molecule, without inducing an activities of a polypeptide of the invention, thereby preventing the 
action of such polypeptide by excluding it from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding 
site of the polypeptide thereby preventing binding to cellular binding molecules, such that 
normal biological activity is prevented. Examples of small molecules include but are not limited 
10 to small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano. J. Neurochem. 56: 560 (1991); 
OUGODEOXYNUCLEOTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESS/ON CRC 
Press, Boca Raton, FL (1988), for a description of these molecules). Preferred potential 
antagonists include compounds related to and variants of the polypetides of the invention. 
15 Each of the DNA sequences provided herein may be used in the discovery and 

development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 
encoding the amino terminal regions of the encoded protein or Shine-Delgamo or other 
translation facilitating sequences of the respective mRNA can be used to construct antisense 
20 sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor 
of the invention to interfere with the initial physical interaction between a pathogen and 
mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
25 bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to 
extracellular matrix proteins in wounds; to block protein-mediated mammalian cell invasion 
by, for example, initiating phosphorylation of mammalian tyrosine kinases (Rosenshine et 
ai. y Infect, Immun. 60:221 1 (1992); to block bacterial adhesion between mammalian 
extracellular matrix proteins and bacterial proteins that mediate tissue damage and; to block 
30 the normal progression of pathogenesis in infections initiated other than by the implantation 
of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to inhibit 
and treat otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleural 
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empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid. 

Another aspect of the invention is a pharmaceutical composition comprising the 
above polypeptide, polynucleotide or inhibitor of the invention and a pharmaceutical ly 
5 acceptable carrier. 

In a particular aspect the invention provides the use of an inhibitor of the invention 
as an antibacterial agent. 

The invention further relates to the manufacture of a medicament for such uses. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
10 specific antibodies which have anti-bacterial action. This invention also contemplates the 
use of the DNA encoding the antigen as a component in a DNA vaccine as discussed more 
fully below. 

The polypeptides or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be. for example, polyclonal or monoclonal 
antibodies. The term antibodies also includes chimeric, single chain, and humanized 
antibod,es, as well as Fab fragments, or the product of an Fab expression library. Various 
procedures known in the art may be used for the production of such antibodies and 
fragments. 

Antibodies generated against the polypeptides of the present invention can be 
obtained by direct injection of the polypeptides into an animal or by administering the 
polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
25 polypeptide. 

Polypeptide derivatives include antigenically or immunologically equivalent 
derivatives which form a particular aspect of this invention. 

The term 'antigenically equivalent derivative" as used herein encompasses a 
polypeptide or its equivalent which will be specifically recognised by certain antibodies 
which, when raised to the protein or polypeptide according to the present invention, 
interfere with the interaction between pathogen and mammalian host. 

The term 'immunologically equivalent derivative' as used herein encompasses a 
peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
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vertebrate, the antibodies act to interfere with the interaction between pathogen and 
mammalian host. 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition, 
5 polypeptides in which one or more of the amino acid residues are modified may be used. 
Such peptides may, for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those skilled in the art of peptide chemistry. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 
10 or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation , with an immunogenic carrier 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 
15 polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve irnmunogenicity so as to obviate the use of a carrier. 

For preparation of monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler and Milstein, Nature . 256:495-497(1975)), the trioma technique, the 
20 human B-cell hybridoma technique (Kozbor et al.. Immunolog y Today 4:72(1983)), and the 
EBV-hybridorna technique to produce human monoclonal antibodies (Cole, et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96): 

Techniques described for the production of single chain antibodies (U.S. Patent 
4,946,778) can be adapted to produce single chain antibodies to immunogenic polypeptide 
25 products of this invention. 

Using the procedure of Kohler and Milstein ( supra . (1975)), antibody-containing 
cells from the immunised mammal are fused with myeloma cells to create hybridoma cells 
secreting monoclonal antibodies. 

The hybridomas are screened to select a cell line with high binding affinity and 
30 favorable cross reaction with other Streptococcal species using one or more of the original 
polypeptide andVor the fusion protein. The selected cell line is cultured to obtain the desired 
Mab. 

Hybridoma cell lines secreting the monoclonal antibody are another aspect of this 
invention. 
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Alternatively phage display technology could be utilised to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes of lymphocytes from humans screened for possessing anti-Fbp or from naive libraries 
(McCafferty. J. et al., £Jatme 348:552-554( 1990), and Marks. J. ex al., Biotechnnlnpv 
5 10:779-783( 1992)). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. et al.. Nature 352:624-628(1991)). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

As mentioned above, a fragment of the final antibody may be prepared. 
1 0 The antibody may be either intact antibody of M r approx 1 50,000 or a derivative of 

it. for example a Fab fragment or a Fv fragment as described in Skerra, A and Pluckthun, 
A., Ssieoce 240: 1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed 'bispecific' antibodies. 

The antibody of the invention may be prepared by conventional means for example 
by established monoclonal antibody technology (Kohler. G. and Milstein, C. ( supra . (1975) 
or using recombinant means e.g. combinatorial libraries, for example as described in Huse. 
W.D. «a/., Scjsacs 246:1275-1281 (1989). 

Preferably the antibody is prepared by expression of a DNA polymer encoding said 
antibody in an appropriate expression system such as described above for the expression of 
20 polypeptides of the invention. The choice of vector for the expression system will be 

determined in part by the host, which may be a prokaryotic cell, such as E. coli (preferably 
strain B) or Streptomyces sp. or a eukaryotic cell, such as a mouse CI 27. mouse myeloma, 
human HeLa. Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a transgenic animal or a transgenic plant (for example, as described in 
Hiatt. A. etal., Haffice 340:76-78(1989). Suitable vectors include plasmids, bacteriophages, 
cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 

The Fab fragment may also be prepared from its parent monoclonal antibody by 
enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 

Preferably the antibody or derivative thereof is modified to make it less 
immunogenic in the patient. For example, if the patient is human the antibody may most 
preferably be 'humanised'; where the complimentarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for 
example as described in Jones, P. et al.. Uatmsz.321 :522-525 (1986), or Tempest etal., 
Biotechnnlnpy 9:266-273 ( 1 99 1 ). 

25 
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The modification need not be restricted to one of 'humanisation' ; other primate 
sequences (for example Newman, R. et al % Biotechnology 10:1455-1460 (1992)) may also 
be used. 

The humanised monoclonal antibody, or its fragment having binding activity, form 
5 a particular aspect of this invention. 

This invention provides a method of screening drugs to identify those which 
interfere with the proteins selected as targets herein, which method comprises measuring the 
interference of the activity of the protein by a test drug. For example if the protein selected 
has a catalytic activity, after suitable purification and formulation the activity of the enzyme 

10 can be followed by its ability to convert its natural substrates. By incorporating different 

chemically synthesised test compounds or natural products into such an assay of enzymatic 
activity one is able to detect those additives which compete with the natural substrate or 
otherwise inhibit enzymatic activity. 

The invention also relates to inhibitors identified thereby. 

1 5 The use of a polynucleotide of the invention in genetic immunisation will 

preferably employ a suitable delivery method such as direct injection of plasmid DNA into 
muscles (Wolffs at, Hum. Mol. Genet. 1:363 (1992); Manthorpe et at:, H^m Gene Ther 
4:419 (1963)), delivery of DNA complexed with specific protein carriers ( Wu et a/., L 
Biol. Chem . 264:16985 ( 1989)), coprecipitation of DNA with calcium phosphate 

20 (Benvenisty & Reshef, Proc. Nat 1 ! Acad. Sci. USA . 83:9551 (1986)), encapsulation of DNA 
in various forms of liposomes (Kaneda et a/., Science 243:375 (1989)), particle 
bombardment (Tang et aL, Nature 356:152 (1992)); Eisenbraun et al y PNA Cell 3iol 
12:79 1 ( 1993)) and in vivo infection using cloned retroviral vectors (Seeger et a/., PrQC- 
Nat'l. Acad. Sci. USA 81 :5849 (1984)). Suitable promoters for muscle transfection include 

25 CMV, RSV, SRa, actin, MCK, alpha globin, adenovirus and dihydrofolate reductase. 

In therapy or as a prophylactic, the active agent i.e., the polypeptide, polynucleotide 
or inhibitor of the invention, may be administered to a patient as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application 

30 for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
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or oleyl alcohol for lotions. Such carriers may constitute from about 1 % to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

For administration to human patients, it is expected that the daily dosage level of 
5 the active agent will be from 0.01 to 10 mg/kg. typically around 1 mg/kg. The physician in 
any event will determine the actual dosage which will be most suitable for an individual 
patient and will vary with the age, weight and response of the particular patient. The above 
dosages are exemplary of the average case. There can, of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
0 invention. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants 
may be employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5 ug/kg of antigen, and such dose is 
preferably administered 1-3 times and with an interval of 1-3 weeks. 

Within the indicated dosage range, no adverse toxicologicals effects are expected 
with the compounds of the invention which would preclude their administration to suitable 
patients. 

In order to facilitate understanding of the following example certain frequently 
occurring methods and/or terms will be described. 

"Plasmids" are designated by a lower case p preceded and/or followed by capital 
letters and/or numbers. The starting plasmids herein are either commercially available 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and the.r reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically 1 ug of plasmid or DNA fragment is used with about 2 units of enzyme 
» about 20 M | of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to 50 M g of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
are specified by the manufacturer. Incubation times of about 1 hour at 37° C are ordinarily 
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used, but may vary in accordance with the supplier's instructions. After digestion the 
reaction is electrophoresed directly on a polyacrylamide ge! to isolate the desired fragment. 

Size separation of the cleaved fragments is performed using 8 percent 
polyacrylamide gel described by Goeddel, D. et at, (1980) Nucleic Acids Res. t 8:4057. 
5 "Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 

complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

10 "Ligation" refers to the process of forming phosphodiester bonds between two 

double stranded nucleic acid fragments (Maniatis, T., et al. % supra., p. 146). Unless 
otherwise provided, ligation may be accomplished using known buffers and conditions with 
10 units to T4 DNA ligase ("ligase") per 0.5 jig of approximately equimolar amounts of the 
DNA fragments to be ligated. 

1 5 Example 1 

Isolation of DNA coding for a virulence gene in Streptococcus pneumoniae 

As mentioned above each of the DN As disclosed herein by virtue of the fact that it 
includes an intact open reading frame is useful to a greater or lesser extent as a screen for 
identifying antimicrobial compounds. A useful approach for selecting the preferred DNA 

20 sequences for screen development is evaluation by insertion-duplication mutagenesis. This 
system disclosed by Morrison et ah, J. Bacteriol . 159:870 (1984), is applied as follows. 

Briefly, random fragments of Streptococcus pneumoniae, strain 0100993 DNA are 
generated enzymatically (by restriction endonuclease digestion) or physically (by sonication 
based shearing) followed by gel fractionation and end repair employing T4 DNA 

25 polymerase. It is preferred that the DNA fragments so produced are in the range of 200-400 
base pairs, a size sufficient to ensure homologous recombination and to insure a 
representative library in E.coti. The fragments are then inserted into appropriately tagged 
plasmids as described in Hensel £t al. f Science 269: 400-403(1995). Although a number of 
plasmids can be used for this purpose, a particularly useful plasmid is pJDC9 described by 

30 Pearce et al., Mot. Microbiol . 9:1037 (1993) which carries the erm gene facilitating 
erythromycin selection in either E. coli or S. pneumoniae previously modified by 
incorporation of DNA sequence tags into one of the polylinker cloning sites. The tagged 
plasmids are introduced into the appropriate 5. pneumoniae strain selected, inter alia, on the 
basis of serotype and virulence in a murine model of pneumococcal pneumonia. 
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It is appreciated that a seventeen amino acid competence factor exists (Havastein « 
al., Etoc. MaO- 6sM. SfiL USA 92: 1 1 140-44 (1995)) and may be usefully employed in this 
protocol to increase the transformation frequencies. A proportion of transformants are 
analysed to verify homologous integration and as a check on stability. Unwanted levels of 
5 reversion are minimized because the duplicated regions will be short (200-400 bp), however 
if significant reversion rates are encountered they may be modulated by maintaining . 
antibiotic selection during the growth of the transformants in culture and/or during growth 
in the animal. 

The 5. pneumoniae transformants are pooled for inoculation into mice, eg., Swiss 
10 and/or C57B1/6. Preliminary experiments are conducted to establish the optimum 
complexity of the pools and level of inoculum. A particularly useful model has been 
described by Veber et al. (J. Amimirmhin t Q iemat hg X .32:432 (1993) in which 10* cfu 
inocula sizes are introduced by mouth to the trachea. Strain differences are observed with 
respect to onset of disease e.g.,3-4 days for Swiss mice and 8- 1 0 days for C57B 1/6. 
.15 Infection yields in the lungs approach 10* cfu/lung. IP administrate is also possible when 
genes mediating blood stream infection are evaluated. Following optimization of 
parameters of the infection model, the mutant bank normally comprising se veral thousand 
strains is subjected to the virulence test. Mutants with attenuated virulence are identified by 
hybridization analysis using the labelled tags from the "input" and "recovered" pools as 
probes as described in Hensel * al., gcjepce 269: 400-403( 1 995). S. pneumoniae DNA is 
colony blotted or dot blotted. DNA flanking the integrated plasmid is cloned by plasmid 
rescue in E. coli (Morrison et al.. UKIfirifll. 159:870 (1984)) and sequenced. Following 
sequencing, the DNA is compared to the nucleotide sequences g,ven herein and the 
appropriate ORF is identified and function confirmed for example by knock-ou, studies 
Expression vectors providing the selected protein are prepared and the protein is configured 
m an appropriate screen for the identification of anti-microbial agents. Alternatively, 
genomic DNA libraries are probed with restriction fragments flanking the integrated 
Plasmid to isolate full-length cloned virulence genes whose function can be confirmed by 
"knock-out" studies or other methods, which are then expressed and incorporated into a 
30 screen as described above. 

The individual full length sequences given herein are summarized in the following 
Tables (Table 1 and Table 2). Under the column in Table 1 labeled "Identity " there is the 
deduced identity of each open reading frame of the invention determined using Blastp 
and/or MPSearch. The ORF# column indicates whether the polynucleotide encoding each 
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ORF encodes more than one ORF. For example, SEQ ID NO:224 has an ORF# 1 but no 
#2; thus the polynucleotide encodes this ORF#l, but no other ORF was detected. On the 
other hand, SEQ ID NO:225 and 226 have ORF#s 1 and 2 respectively, indicating that they 
were both encoded by the same polynucleotide. This can also be seen in Table 2 where the 
5 polynucleotide of SEQ ID NO: 2 encodes the ORFs of SEQ ID NOS: 225 and 226. Table 1 
also shows the start ("START" column) and stop ("STOP" column) codons for each ORF 
and their positions in the encoding polynucleotide sequence. The SEQ ID NOS of the 
polypeptides of this table are linked to both a deduced identity in this table and a 
polynucleotide sequence in Table 2 which encodes each polypeptide. The "Direction" 
10 column in Table 1 shows the direction of the ORF encoding each poypetide in this table. 
"Forward" denotes the sense orientation and "Reverse" denotes the antisense orientation of 
the ORF. 
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TABLE 1 









Codon 




Position 1 




SEQ ID NO: 


Identity 


ORF# 


Start 


Stop 


Start 


St on 


DirectX n 


224. 


Phosphate Transport ATP-Binding Protein 
PSTB - Escherichia 


1 


-CAT 


TCA- 


2 




Reverse 


225. 


probable ATP binding protein - Bacillus 
subtilis 


1 


-CAT 


TCA- 


38 


7« 1 

/ O I 


Reverse 


226. 


Nopaline Transpoa System Permease 
Protein Nocm. - Agrobacterium Tumefaciens 


2 


-CAG 


TTA- 


78 1 


1 J 1 Z 


Reverse 

i 


227. 


Aspanate Aminotransferase A (EC 2.6. 1 1 ) 


1 


ATG 


TAG 


480 


67 1 


Forward 




(Transaminase A) (ASPAT.) - Rhizobium 














Meiiloti. 














228. 


ISL2 protein - Lactobacillus heiveticus 


1 


ATG 


TAA 


549 


947 


Forward 


229. 


ISL2 protein - Lactobacillus heiveticus 


2 


ATG 


TAG 


1889 


1353 


Forward 


230. 


Unknown 


3 


ATG 


TGA 


1421 


1570 


Forward 


231. 


Unknown 


2 


-CAT 


CTA- 


i i i 
i i i 


4 1 3 


Reverse 


232. 


Unknown 


I 


ATG 


TGA 


t jj 




Forward 


233. 


Acyl Carrier Protein (ACP.) - Rhizobium 
Meiiloti. 


1 


-CAT 


CTA- 


o / J 




Reverse 


234. 


Unknown 


1 


-CAT 


TTA- 


15 


137 


Reverse 


235. 


Unknown . 


2 


-CAT 


CTA- 


OO J 




Reverse 


236. 


Sulfate Transport System Permease Protein 
CYST (Fragment). - Synechocystis SP. 
(Strain PCC 6803). 


2 


-CAT 


TTA- 


336 


689 


Reverse 


237. 


probable transposase (insertion sequence 
IS86I) - Streptococcus agalactiae (str ain 
COH-H 


-» 


-CAT 


TCA- 


149 


454 


Reverse 


238. 


Unknown 


2 


-CAG 


TTA- 


567 


851 


Reverse 


239. 


"PTS SYSTEM 


J 


ATG 


TAA 


49 


477 


Forward 


240. 


M PTS SYSTEM 




-CAT 


TTA- 


151 


789 


Reverse 


241. 


nitrogen fixation protein (nifS) homolog - 
Haemophilus influenzae (strain Rd KW 20) 


1 


-CAT 


TCA- 


1 


207 


Reverse 


242. 


cellobiose phosphotransferase system celA - 
Bacillus stearothermophilus 


1 


-CAT 


TCA- 


84 


431 


Reverse 


243. 

244. _J 


surface protein PspA - Streptococcus 

pneumoniae 

Unknown 


1 


ATG 


TAA 


22 


321 


Forward 


245. 


ATP-dependent Clp proteinase (EC 3.4.21 .-) 
chain clpL - Lactococcus lactis subs p. lactis 


2 
1 


ATG 
ATG 


TAA 
TGA 


272 
83 


469 
760 


Forward 
Forward 
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plasmtd pUCL22 












246. 


Beta-Glucosidase (EC3.2.1.21) 
(Gentiobiase) (Cellobiase) ) (Beta-D- 
Glucosidc GL Ucohydroiasc) (Amygdalase). 
- Bacillus Subtilis. 


1 


-CAT 


1 LA* 


oj 


599 


Reverse 


247. 


Glycine Betaine/L -Proline Transpon ATP - 
Bindine Protein Prov. - Escherichia Col. I. 


2 


ATG 


TAA 


159 


887 


Porwarrl 

L Ul Vt CXI u 


248. 


Unknown 


3 


-CAT 


11 A- 


1 102 


1754 


Reverse 


249. 


30S Ribosomal Protein SI 1 (BSl 1). - 
Bacillus Subtilis. 


1 


ATG 


TUA 


15 




Fnnif g rr~i 

rorwaiu 


250. 


DNA - Directed RNA Polymerase Alpha 
Chain (EC 2.7,7.6) (Transcriptase Alpha 
Chain). - Bacillus Subtilis. 


2 


ATG 


TAA 


282 


1217 


Forward 


251. 


Peptide Chain Release Factor 3 (RF-3). - 
Bacteroides Nodosus f Dichelobacter Nod 
Osus>. 


1 


ATG 






667 


For\A/2irH 


252. 


Unknown 


! 


ATG 


TAG 


267 


353 


Forward 


253. 


Riboflavin Synthase Alpha Chain (EC 
2.5.1.9). - Bacillus Subtilis. 


-> 


-CAT 


CTA- 


213 


662 


Reverse 


254. 


Unknown 


3 


-CAT 


TTA- 


833 


1045 


Reverse 


255. 


Unknown 


! 


-CAT 


TTA- 


83 


424 


Reverse 


256. 


mesi protein - Leuconostoc mesenteroides 


2 


ATG 


TAG 


448 


1302 


Forward 


257. 


Transacetyiase BMTD (EC 2.3.1.-). - 
Bacillus Subtilis. 


I 


CTG 


TAG 


3 


320 


Forward 
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278. \ 


Jnknown 


1 


-CAT < 


rrA- 


165 : 


535 I 


Reverse 


279. 1 
( 
1 


lipoprotein Signal Peptidase <EC 3.4.23.36) 
Prolipoprotein Signal Peptidase) (Signal 
Peptidase II) (Spase 11). - Staphylococcus 
Aureus. 


1 


ATG ' 


rAA t 


56 


517 I 


"orward 


280. [ 


Unknown 


! 


ATG ' 


TAA 


214 


534 


Forward 


281. 


Alpha- Acetoiactaie Decarboxylase (EC 
4.1.1.5). - Bacillus Subtil is. 


1 


-CAT < 


CTA- 


104 


445 


Reverse 


282. 


Dihydrodipicolinace Synthase 9EC 4.2.1.52) 
(DHDPS). - Bacillus Subtil is. 


3 


-CAT 


TCA- 


675 


884 


Reverse ■ 


283. 


Polyribonucleotide Nucleotidyltransferase 
(EC 2.7.7.8) (Polynucleotide Phosphor 
Ylase) (Pnpase). - Photorhabdus 
Luminescens. 


1 


-CAT 


TCA- 


1 


855 


Reverse 


284. 


Single-Strand Binding Protein (SSB) (Helix- 
Destabilizine Protein). - Bacillus S Ubtiiis 


1 


-CAT 


TTA- 


128 


598 


Reverse 


285. 


ATP-Dependent CLP Protease ATO- 
Bindinp Subunu CLPX. - Escherichia Coli. 


2 


-CAT 


CTA- 


195 


482 


Reverse 


286. 


ATP-Dependent CLP Protease ATO- 
Binding Subunit CLPX. - Escherichia Coli 


4 


-CAG 


TCA- 


676 


990 


Reverse 


287. 


N-^-Phosphoribosyl) Anthranilate 
Isomerase (EC 5.3.1 .24) (PRAJ). - 
Lactococcus Lactis (Subsp. Lactis) 
(Streptococcus Lactis). 


1 


ATG 


TAA 


83 


712 


Forward 


288. 


30S Ribosomal Protein S2. - Escherichia 
Coli. 


1 


ATG 


TGA 


277 


591 


Forward 


289 


30S Ribosomal Protein S2. - Escherichia 
Coli. 


3 


ATG 


TGA 


918 


1064 


i 

1 Forward 

i 


290. 


Possible beta-eaiactosidase precursor 


1 


-CAT 


CTA- 


120 


545 


1 Reverse 


291. 


Unknown 


1 


-CAT 


TTA- 


77 


283 


I Reverse 


292. 


Unknown 


1 


-CAT 


TTA- 


1 


198 


1 Reverse 


293. 


'Thospho-2-Dehydro-3-Deoxyheptonate 
Aldolase 


1 


-CAT 


TTA- 


217 


690 


Reverse 
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SEQ ID NO; 



Identity 



ORF# 



Codon 
Start 



Stop 



Position 
Start 



Stop 



Direction 
Reverse 



295 
296. 



Unknown 



Unknown 



-CAT TTA~ 



-CAT 



TCA~ 



429 



599 



936 
480 



297. 



Sorbitol Dehydrogenase (EC 1.1.1.14) (L- 
Iditol 2-Dehydrogenase). - Bacillus Sub 
Tilis. 



ATG 



TAA 



64 



EBG Qperon Repressor - Escherichia Coli 



-CAT 



CTA- 



Forward 



301. 



302. 



303. 



304. 



305 



306. 
307 



308. 



ceilobiose phosphotransferase system celA - 
Bacillus stearothermop hilu* 
Unknown 

Adenylosuccinate Lyase (EC 4.3.2.2) 
(Adenylosuccinate) (ASL). - Bacillus Subtil 

Is. 



-CAT 



TTA- 



249 



ATP-Binding Protein BEXA. - Haemophilus 
Influenzae. 



L-Lactate Dehydrogenase (Cytochrome) (EC 
1.1.2.3). - Escherichia Coli. 



Phosphate Transport System Permease 
Protein PSTC. - Escherichia r^li 



Sulfate Transport ATP-Binding Protein 
Cysa. - Synechococcus SP. (Strain PCC 794 
2) (Anacvstis Nidulans R2V 



Unknown 



Unknown 



Unknown 



Phosphonbosy I formy Iglycinamidine Cyclo- 
Ligase (EC 6.3.3. 1 ) (AIRS) (Phosphoribo 
Syl-Arninoimidazole Synthetase) (AIR 
ithase ). - Bacillus Subtilis 




ATG 



ATG 



ATG 



-CAT 



TGA 



TAA 



TGA 



CTA- 



•CAT 



ATG 



-CAT 



TTA- 
TCA- 



TAG 



TCA- 



111 



337 



507 



110 



838 



282 



I 108 

331 



1060 
566 



Reverse 
Reverse 




404 



Forward 



507 



1070 



838 



1077 



749 



Forward 



Forward 



Reverse 



Reverse 



Reverse 



1290 Forward 



477 Reverse 
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318. 


"PTS Svstem 


2 


-CAT 


TCA- 


166 


465 


Reverse 


319. 


L-Fucosc isomerase (EC 5.3. 1 .-) - 
Escherichia Coli. 


1 


-CAT 


CTA- 


9 


482 


Reverse 


320. 


Unknown 


2 


-CAT 


TTA- 


495 


650 


Reverse 


321. 


Unknown 


1 


ATG 


TGA 


130 


231 


Forward 


322. 


3-Oxoacyl-[Acyl-Carrier Protein) Reductase 
(EC 1. 1.1. 100) (3-Ketoacyl- Acy 1 Car Rier 
Protein Reductase). - Escherichia Coli. 


1 


-CAT 


TCA- 


51 


200 


Reverse 


323. 


3-Oxoacyi-lAcyl-Carrier Protein! Reductase 
(EC 1.1.1.100) (3-Ketoacyl- Acy 1 Car Rier 
Protein Reductase). - Escherichia Coli. 


2 


-CAT 


TCA- 


231 


614 


Reverse 


324. 


2-Isopropylmalate Synthase (EC 4.1.3.12) 
(Alpha-lsopropylmaJate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


1 


-CAT 


TTA- 


31 


231 


Reverse 


325. 


2-Isopropylmaiate Synthase (EC 4. 1 .3. 12) 
(Alpha-isopropylmalate Synthase) (Alph A- 
EPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


2 


-CAT 


TCA- 


191 


334 


Reverse 


326. 


Unknown 


3 


-CAT 


TCA- 


309 


452 


Reverse 


327. 


2-Isopropylrnalate Synthase (EC 4.1.3. 12) 
(Alpha-Isopropylmaiaie Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


4 


-CAT 


TTA- 


495 


I 127 


Reverse 


328. 


Unknown 


2 


-CAT 


TCA- 


170 


469 


Reverse 


329. 


Unknown 


I 


ATG 


TAG 


105 


473 


Forward 


330. 


StrscaaNcbi gi: 310629NCBI gi: 473 - 
Streptococcus eordonii (strain PK488^ DNA 


1 


-CAG 


CTA- 


81 . 


665 


Reverse 
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SEQ ID NO: 

[331. 



332. 



333. 



334. 



335. 



m6. 



337. 



338. 



339. 



340. 



341. 



342. 



£46. 



347. 



1348. 



Identity 

Unknown 



lysyl arninopeptidase (EC 3.4.1 1.15) 
precursor - Lactococcus laccis 



Indole-3-GIycerol Phosphate Synthase (EC 
4.1.1.48) (IGFS). - Lactococcus Lactis 
(SUBSP. La ctis) (Streptococcus Lactis) 
Anthraniiate Phosphoribosy J transferase (EC 
2.4.2. 18).-Lactococcus Lactis (Sub SP. 
Lactis) (Streptococcus Lactis V 



I Cod n 



ORF#IStart 



Tagatose-6-Phosphate Kinase (EC 2.7. 1.-) 
(Phosphotagatokinase). - Lactococcus L 
Actis (Subs p. Lactis) (Streptococcus Lacu*\ 



Unknown 



[Unknown 



I Unknown 



Acetolactate Synthase Large Subunit (EC 
4. 1.3.18) (AHAS) (Acetphydroxy-Acid Syn 
thase Large Subunit) (ALS). - Lactococcus 
Lactis (SUBSP. Lactis) (Streptococc us 
Lactis). 



Penicillin-Binding Proteins 1A/IB. 
Bacillus Subtilii; 



Unknown 



Anthraniiate Phosphoribosy (transferase (EC 
2.4.2.18). - Lactococcus Lactis (SUB SP. 
Lactis) (Streptococcus Lactis). 



E protein - Lactococcus Lactis 
Unknown 
Unknown 



Unknown 



Unknown 



Unknown 



Unknown 



Unknown 
NIFS Protein Homolog (Fragment). - 
Lactobacillus Delbnieckii (SUBSP. 
Bujeancus). 

Ornithin e Carbamoyl transferase 
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ATG 



-CAT 



-CAT 



-CAA 



TCA- 



ATG 



ATG 



-CAT 



ATG 



ATG 



TCA- 



TCA- 



TGA 



Position 



Star t 

228 



107 



127 



366 



42 



TGA 



TTA- 



73 



32 



TAG 



TGA 



ATG 



-CAT 



TAG 



TCA- 



306 



59 



535 



165 



ATG 

-CAT 

-CAT 



ATG 



-CAT 



TAA 

CTA~ 

TTA- 



TAA 



124 



34 
377 



-CAT 



-CAT 
ATG 



TTG 



CTA- 



CTA- 



159 



315 



468 



CTA- 



TTA" 
TAG 



482 



766 



369 



641 



Direction 

Forward 



Reverse 



Reverse 



524 



Reverse 



Forward 



474 (Forward 



286 'Reverse 



572 [Forward 



502 



720 



Forward 



Forward 



488 



394 



Forward 



Reverse 



543 



195 



Forward 



Reverse 



544 i Reverse 



503 I Forward 



644 [Reverse 



587 



631 



Reverse 



Reverse 



757 j Reverse 



797 



Forward 



1502 I Forward 
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353. I 

J 


PSEGLI NCBI gi: 499660 - Pseudomonase 
DUtida. 


1 


-CAT * 


IT A- , 


50 


481 1 


Reverse 


354. 


Unknown 


I 


CTG 


TAA 


1 


522 


Forward 


355. 


Thioredoxin. - Strepiomvces Clavuhserus. 


I 


ATG 


TAG 


58 


375 


Forward 


356. 


D-alaninc permease (dagA) homotog - 
Haemophilus influenzae (strain Rd KW20) 


1 


ATG 


TAG 


185 


343 


Forward 


357. 


Unknown 


1 


-CAT 


TTA- 


78 


371 


Reverse 


358. 


"DNA Polymerase III 


1 


-CAA 


TCA- 


82 


561 


Reverse 


359. 


Transport ATP-Binding Protein Coma. - 
Streptococcus Pneumoniae. 


1 


-CAT 


TTA- 


112 


552 


Reverse 


360. 


Unknown 


2 


ATG 


TAA 


445 


540 


Forward 


361. 


surface protein PspA - Streptococcus 
pneumoniae 


1 


-CAT 


TCA- 


45 


416 


Reverse 


362. 


Licid Protein. - Haemophilus Influenzae 


2 


ATG 


TAA 


268 


633 


Forward 


363 


Unknown 


2 


ATG 


TAA 


527 


640 


Forward 


364. 


Glutamtne Transport ATP- Binding Protein 
GLNQ- - Escherichia Coli. 


1 


CTG 


TGA 


1 


393 


Forward 


365. 


Unknown 


1 


Iatg 


TAG 


184 


303 


Forward 


366. 


Unknown 


1 


ATG 


TGA 


794 


919 


Forward 


367. 


Lipoamide Dehydrogenase Component (E3) 
of Pyruvate Dehydrogenase Complex (EC 1 
8.1.4) (Dihydrolipoamide Dehydrogenase). - 
Azotobacter Vinelandii. 


i 


ATG 


TAA 


3 


416 


Forward 
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influenzae (strain Rd KW20) 














394. 


Unknown 


2 


ATG 


TGA 


319 


546 


Forward 


395. 


dihydrolipoamide dehydrogenase (EC 
1.8. 1.4.) - Pelobacter carfoinoiicus 


I 


ATG 


TGA 


3 


284 


Forward 


396. 


Unknown 


2 


ATG 


TGA 


241 


450 


Forward 


397. 


Beta-Glucosidase A (EC 
3.2.1.21XGentiobiaseMCellobiase)(Beta-D- 
Glucoside Glucohydrolase). - Clostridium 
Thermocelhim. 


1 


ATG 


TAA 


184 


453 


Forward 


398. 


Uracil Permease. - Bacillus Caldolvticus. 


1 


ATG 


TAA 


93 


353 


Forward 


399. 


Unknown 


2 


ATG 


TAG 


127 


516 


Forward 


400. 


ligoendopewidase F- Lactococcus lactis 


1 


ATG 


TGA 


134 


310 


Forward 


401. 


Stratpasea NCBI gi: 153565NCB1 gi: 4. 
Streptococcus Faecaiis DNA. 


2 


ATG 


TAA 


392 


568 


Forward 


402. 


Unknown 


2 


ATG 


Itaa 


376 


507 


Forward 


403. 


Exodeoxyribonuc lease Small Subunit (EC 
3. 1. 1 1 .6) (Exonuclease VII Small Subunit). - 
Escherichia Colt. 


2 


-CAT 


TCA- 


470 


682 


Reverse 


404. 


ATP-Depenent DNA Helicase RECG (EC 
3.6. 1.-V - Escherichia Coli. 


1 


ATG 


TGA 


3 


455 


Forward 
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Subtilis. 




1 






433. 


rri\j/*r\c\j I f r^i ncfpM flofD\ hnmnlna - 

Haemophilus influenzae (strain Rd KW20) 


1 

* 


ATG 


TGA 


127 


399 


Forward 


434. 


Unknown 


2 


-CAT 


CTA- 


244 


435 


Reverse 


435. 


Unknown 


2 


ATG 


TGA 


118 


309 


Forward 


436. 


Factor essential for Expression of 
Methicillin Resistance. - Staphylococcus 
Aur Eus. 


2 


-CAT 


TTA- 


168 


434 


Reverse 


437. 


Unknown 


1 


-CAT 


TCA- 


2 


214 


Reverse 


438. 


Unknown 


1 


TTG 


TAA 


2 


142 


Forward 


439. 


Unknown 


1 


ATG 


TAA 


30 


191 


Forward 
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SEQ ID NO: 



440. 



Identity 

Aspartate Aminotransferase (EC 2.6.1.1) 
(Transaminase A)(ASPAT). - Bacillus SP. 
(STRAIN YM-2Y 
Unknown 



ORF# 



Codon 
Start 



-CAT 



ATG 



Stop 



TCA- 



TAA 



Positio n 
SUrt 



63 



52 



Stop 



416 



342 
455 



Direction 



Reverse 



442. 
443. 



447. 



448. 



449. 



450. 



451. 



452. 



453. 



454. 



455. 



456. 



Unknown 



Unknown 

ATP-Dependent DNA Helicase RECG 
(EC 3.6.1 .-) - Escherichia Coli. 



-CAT 



TCA- 



210 



TTG 
ATG 



TGA 
TGA 



Unknown 
Cell Division Protein FTSA. 
Subtilis. 



Bacillus 



Unknown 



Unknown 



Unknown 



Unknown 



Unknown 



Unknown 



Unknown 



Unknown 



JAG Protein (SPOIIU Associated Protein). 
Bacillus Subtilis. 



DNA -3-Methyiademne Glycosidase I (EC 

3.2.2.20)(3-Methyladenine-DNA GIvcosylas 
EI 



ATG 
TTG 



TAA 
TAA 



ATG 



ATG 



CTG 



TAA 



ATG 



ATG 



ATG 



TAG 



TAG 



TAA 



TGA 



ATG 



ATG 



ATG 



ATG 



TGA 



2_ 
83 



29 

3 



70 



120 



31 



TGA 



TGA 



TGA 



TAG 



225 



290 



93 



91 



Reverse 



517 
376 



Forward 
Forward 



384 
371 



441 



Forward 
Forward 



Forward 



454 1 Forward 



159 



347 



423 



416 



418 



269 



365 



282 



Forward 



Forward 



Forward 



Forward 



Forward 



Forward 



Forward 



Forward 
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468. 


Unknown 


1 


ATG 


TGA 


385 


492 


Forward 


469. 


Unknown 


2 


ATG 


TAA 


587 


721 


Forward 


470. 


gaiE protein - Neisseria meningitidis 


1 


-CAT 


TCA- 


23 


460 


Reverse 


471. 


Unknown 


2 


-CAG 


TTA- 


717 


1319 


Reverse 


472. 


Naphthoate Synthase (EC 4. i. 3.36) 
(Dihydroxynaphthoic Acid 
Synthetase KDhna Sy Nthetase). - 
Escherichia Coli. 


I 


ATG 


TGA 


97 


414 


Forward 
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Multiple Sugar-Binding Transport ATP 
Binding Protein MSMK. - Streptococcus 
MUTANS. 



45 
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Table 2 shows the correlation between the SEQ ID NO of each DNA sequence of 
the invention with the SEQ ID NO(S) of polypeptide or polypeptides that its open reading 
frame(s) encodes. For example, the DNA of SEQ ID NO: 1 encodes one polypeptide, that 
of SEQ ID NO:224. Whereas, the DNA of SEQ ID NO:2 encodes two polypeptides, the 
polypeptides of SEQ ID NO.225 and SEQ ID NO:226. 



TABLE 2 



DNA 


Protein (open reading frame) 




1 


224 


2 


225,226 


3 


227 


4 


228,229,230 


5 


231 


6 


232 


7 


233 


8 


234 


9 


235 


10 


236 


11 


237 


12 


238 


13 


239 


14 


240 


15 


241 


16 


242 


17 


243,244 


18 


245 


19 


246 • 


20 


247,248 


21 


249.250 


22 


251 


23 


252 


24 


253,254 


25 


255,256 


26 


257 


27 


258 


28 


259 


29 


260,261 


30 


262 
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1 1 

3 1 


263 




32 


264 




33 


265 




34 


266 




35 


267 




36 


268,269 




37 


270 




3o 


271 




39 


272 




40 


273 




A t 

41 


274 




42 


275 




43 


276 




44 


277 




45 


278 




46 


279 




47 


280 




A O 

48 


281 




49 


282 




5U 


283 




5 1 


284 




52 


285,286 




53 


287 




C A 

54 


288,289 




55 


290 




JO 


291 




57 


292 




58 


293 




59 


294,295 




60 296,297 " 


61 


298,299 




62 300 


63 301 " 


64 


302,303 




65 


304,305 




66 


306 




67 


307 




68 


308 




69 309 


70 


310 




71 311 


72 312,313 ' 1 
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73 


314 


74 


315,316,317 


75 


318 


76 


319.320 


77 


321 H 


78 


322,323 


79 


324.325,326,327 


80 


328 


81 


329 


82 


330 


83 


331 


84 


332 


85 


333,334 


86 


335 


87 


336 


88 


337 


89 


338 


90 


339 


91 


340 


92 


341 


93 


342 


94 


343 


95 


344,345 


96 


346 


97 


347 


98 


348 


99 


349 


100 


350 


101 


351 


102 


352 


103 


353 


104 


354 


105 


355 


106 


356 


107 


357 


108 


358 1 


109 


359 


110 


360 


111 


361 


112 


362 


113 


363 


114 


364 
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157 


407 


1 58 


408 


159 


409 410 


160 

1 UV 


41 1 

"Til 


lOI 




162 


415 


165 


414 


164 


415 


16^ 


416 


166 


417 
*ti 1 


167 


41ft 


16ft 


41Q 

17 


16Q 


420 


170 


421 


171 
i / 1 


422 


177 


425 


175 


424 


1 74 




17^ 


476 


176 


427 


177 


428 


17ft 


429 


17Q 


450 


1 80 


451 


1 ft 1 


452 


1 R9 


45^ 


185 


454 




455 


185 


456 


186 


457 


187 


438 


1 88 


459 


1 o~ 


440 


1Q0 


441 


191 


442 


192 


443 


195 


444 


194 


445 


195 


446 


196 


447 


197 


448 


198 


449 
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199 


450 






200 


451 






201 


452 






202 


453 






203 


454 






ZU4 


455 






one 
ZOj 


AC ^ 

456 






ZOo 


A CJ 

457 






207 


458 






208 


459 






209 


460 






2IU 


461 






211 


462,463 






212 


464 






213 


465 






214 466 • ■ ~ 


215 


467 






216 


468,469 






217 


470,471 






218 


472 






219 


473 






220 


474 






221 


475 






222 


476 






223 


477 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

<i) APPLICANT: Black, Michael 
Hodgson, John 
Knowles , David 
Nicholas, Richard 
Stodola, Robert 

(ii) TITLE OF THE INVENTION: Novel Compounds 

(iii) NUMBER OF SEQUENCES: 477 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 709 Swede land Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-0939 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE; Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

<vi) CURRENT APPLICATION DATA : 

(A) APPLICATION NUMBER: 

(B) FILING DATE : 01-APR-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/014690 

(B) FILING DATE: 02-APR-1996 

PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/025788 

(B) FILING DATE: 22-AUG-1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Ginuni , Edward R 

(B) REGISTRATION NUMBER: 38,891 
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(C) REFERENCE/ DOCKET NUMBER: P5 046 6 



PCT/US97/053O6 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 



TAGATAGCTT CCGAACTTAC GAG 

(2) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



SEQ ID NO: 


1: 






CGCAATCGGA 


TCCAAGGCTG 


AGGCTGGGCT 


60 


AGATGGCACG 


AGCGATACAG 


AGACGTTGTT 


120 


TGTGGAGATC 


GTCTTTAACC 


TGATCCCAAA 


180 


TTTCATCTAG 


GACTTGCTTA 


TCCTTAACTC 


240 


GGTAAATTGA 


CTTAGCAAAT 


GGATTGGGGC 


300 


TTTCATAAAC 


GTTGATTTCT 


GGACGGTTGA 


360 


TTACTTTAGC 


AATATCAATG 


GTATCATTCA 


420 


TCCCCGATCC 


CGACGGACCA 


ATCAAAGCTG 


480 


ATCCCCTTAA 


TGGATTCATT 


TTTACCATAG 


540 


ACTTTTTCTT 


CAGGAAAGGT 


AAGGATATGC 


600 


TCTCCTTTAG 


GCAGCGGTTA 


ATTTCTTGTG 


660 








683 



(Xi) SEQUENCE DESCRIPTION: SEQ ID MO:2: 
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CTAGTCCAAC 


TAACTGAGTT 


TTCCTTTATO 


TATTATATCA 


AATATAAGTC 


CGTTTGTAAC 


60 


TAGTGAAGAA 


TTCTTTTGTC 


CGCTCTTCTT 


TAGGGGTGTG 


GATAATCTCA 


TCCGGAGTTC 


120 


CAGACTCGAT 


GATTTTCCCC 


TTATCTAAGA 


AGAGAATTTT 


ATCCGCAACT 


TGGGCTACAA 


180 


AGGGCATGTC 


ATGACTGACC 


AAAATCATGG 


TCTGACCTGA 


CTTAGCAGCA 


TCTGCAATAG 


240 


ACTTTTCTAC 


TTCACCGACC 


AATTC TGGGT 


CAAGGGCTGA 


AGTTGGTTCG 


TCTAAGAGCA 


300 


AAACATCTGG 


TTTCATAGCA 


AGCGCACGCG 


CTAGGGCAAC 


CCGTTGCTTC 


TGTCCACCTG 


360 


ATAAATGGCG 


AGGATAATGG 


TTTTCACGGT 


CCGAAAGCCC 


AACCTTAGCC 


AACTCTTCCT 


420 


TGGCAATCTT 


AGTCGCTTCT 


TGGTCAGATA 


ATTTCTTGAC 


AACAACCAAG 


CCTTCTTTCA 


480 


CATTATCAAG 


TGCTGTTCGG 


CGTTCAAATA 


AATTAAACTG 


TTGGAAAACC 


ATAGACAACT 


540 


TACGGCGTAG 


GGCAAGGATT 


TCTTCTTGAG 


TGATTTTAGA 


AAAATCAACT 


GAAAAACCAT 


600 


CAATCTGAAT 


AGAGCCACTG 


TCAGGTGTTT 


CAAGATAATT 


GAGACTGCGA 


AGAAAGGTTG 


660 


ATTTTCCAGC 


TCCTGAAGAA 


CCAATCAAGG 


CTACAACTTC 


CCCTTTTTGA 


ATATC C AAGT 


720 


TCAGATGATC 


CAAGACAGTC 


TGTCCTGAAA. 


AGGATTTGCT 


TAAATTCGAA 


ATCTTAATCA 


780 


TTAACGAAGG 


TCTCCTTTCA 


CATCTGTTTG 


CACTGTATCA 


GGTGCAGAAA 


TAGCCATTTT 


840 


TCTCTCGATG 


AAACGACCGA 


GGCTTTCAAT 


TCCGATATTG 


ACTACCCAAT 


AAACAAGGGC 


900 


AACAGAGATG 


AAGCGTTCAA 


AATAGCGATA 


ATCAGCTCCA 


CCTAGAATCT 


GAGCTTGGGC 


960 


AAAGACTTCC 


ACAACACCCG 


CACTAAAAGC 


TAGAGATGTT 


CCCTTGGTCA 


AACCGATGAG 


1020 


GGAATTAATC 


AAGGTTGGAG 


TAGCTACCAC 


CGCTGCATTA 


GGAATAATCA 


CTCGTCGATA 


1080 


AACTTGCGCT 


CGGGTCATAC 


CCAGACTGCG 


CGCCGCCTCA 


ATCTCACCAG 


GATTAACTGA 


1140 


GAGAATGGCT 


GCACGAATGG 


TTTCACTAGC 


ATAAGCTGCC 


TCATTAAAGG 


CAAAAGCGAC 


1200 


AATCGCAAAA 


GCTGCAGCTG 


GAATCGCATT 


GATATTGAGA 


CCAGTTCCCC 


ATTGCTGATT 


1260 


GAGGGCTTTC 


AAAGCCAAAG 


GGATTCCGTA 


GTAGGTCAAC 


ATGAGTTGCA 


CCAAAATCGG 


1320 


TGTCC CTTTT 


AAGAAACTAA 


CAAAGAAGGC 


CTGCAAGGGA 


TATAAAATCT 


TG AC AC G ATT 


1380 


GATCTTCACA 


ATGGCAAAAA 


GAAGCGCCAA 


AACCAAGCCA 


AAAAGGGCAC 


CGCCAATTGT 


144C 


CAACATAATT 


GTTGTTGGAA 


GTTGTTGGAC 


AATTCTAGGG 


ATTCCATCAA 


AGACCGAACG 


1500 


TAGGCTAAAC 


AG 










1512 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : 1 ine ar 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTGGAGATTA TTGAGATTGC CCGTCAAAAC GATTTGATTA TNTTTGCGGA TGAAATCTAT 60 

GACCGCATGG TAAATGGACG GACATGTGCA TAACGCCTGT GGCGAGCTTG GCACCAGATG 120 

TCTTCTGTGT CAGCATGAAT GGTCTGTCAA AATCCCACCG CATAGCAGGT TTCCCGTGTG 180 

GGATGGATGG TCTTGTCTGG CCCTAAGACT - CATGTTAAGG GCTATATCGA AGGGCTCAAT 2 40 
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ATGCTGTCCA ATATGCGCCT TTGCTCTAAC GTTTTGGCTA CCAACGCTGC ATTAGGAATA 3 00 

ATCACTGTCG TACAAACTTG CGC TTGGGGG GTCACCAATC AGTCGATGAA TTGCTTCTTC 3 60 

CTGGTGGACG AATCTACGAG CAAAGAAATT TCATCTATAA TGCCATTCAA GATATTCCAG 420 
GTTTGTCTGC CGTTAAACCC AAGGCGGGGC TCTATATCTT CCCAAAAATC GACCGCAATA 
TGTACCGTAT CGATGATGAT GAGCAGTTTG TCCTTGATTT CTTGAAGCAG GAAAAGGTTC 
TCTTGGTTCA TGGTCGAGGC TTTAACTGGC AGGAACCAGA CCACTTCCGT ATCGTTTACC 
TTCCTCGTGT TGATGAGTTA GCCCAAATCC AAGAAAAGAT GACTCGTTTC TTGAAACAGT 
ATCGTAGATA GGGCTTGCAT TCGAAAAAGC TGGAAACATT TGCCTAGAG 



480 
540 
600 
660 
709 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

CTGCTTCAGA ACCTGCCAAA CCAGCACCGA TAACATTGAT ATAAGATTGA GACACGACAC 
TAATACCTCT TTGGGAGTGT GAAGTTAAGA TTCACATTGA AAAAGCCAAT CAGACTTACA 
AGCTTTCGAG TTTCTTGGCT CAGGCTGAAA AAGTCCACAG GGCTTTTNCA CTCCCACAAA 
TCTTTCTATT TTTTCTNCTA CTAGTATAAC AAAAAAAGGG AAGAAGGNAA ACTTCCCTGT 2« 
TTAGTCATTT TCTTGATGTA AAGAGATAGT GAGTATTCCA GTTAAGAATC AATTATATGC 300 
TACTCTATAA AATCTTTTCC ACATAACGGA TCGATAGGGA CTGTTATTCT ATCTCTTGCT 
ATAAC CGTAT TATCTAAAAC AGCATAACAT TCAACATAGT GATCTCCTTT AAACTGTGAA 
TCTTCCGTGA TATTTTATTT TACCTGAAAA AATAGCACAC GCTCACAATT CTTCCTAATA 
GCCTCAGCTT CAATATTTCT TACTTTCCAA TAGACTCCCT GCGAAACAAA ATATGGTATA 
GTAGTTCTAT GAATGATGAA GCAAGTAAAC AACTAACTGA TGCACGATTT AAGCGTCTTG 
TTGGTGTTCA GCGCACGACT TTTGAAGAGA TATTAGCTGT ATTAAAAACA GCTTATCAAC 
TTAAACACGC AAAAGGTGGA CGAAAACCTA AATTAAGCCT AGAAGACCTT CTTATGGCCA 
CTCTTCAATA TGTGCGAGAA TATCGAACTT ATGAACAAAT TGCGGCTGAT TTTGGTATCC 
ACGAAAGCAA CTTAATCCGT CGGAGCCAAT GGGTTGAAGT AACTCTTGTT CAAAGTGGTG 
TTACGATTTC AAGAACTCCT CTCAGTTCTG AGGACACGGT AATGATTGAT GCGACGGAAG 
TACAAATCAA TCGCCCTAAA AAAAGAATTA GCGAATCATT CTGGTAAAAA GAAATTTCAC 
GCTATGAAGG CTCAAGCGAT TGTCACAAGT CAAGGGAGAA TTGTTTCTTT GGATATCGCT 
GTGAACTATA GTCATGATAT GAAGTTGTTC AAAATGAGTT GCAGAAATAT CGGACAAGCT 
GGAAAAATCT TGGCTGATAG TGGTTATCAA GGGC CCATGA AGATATATCC TCAAGCACAA 
ACTCCACGTA AATCCAGCAA ACTCAAGCCG CTAATAGCTG AAGATAAAGC TTATAACCAT 
GCGCTATCCA AGGAGAGAAG CAAGGTTGAG AACATCTTTG CCAAAGTAAA AACGTTTAAA 
ATGTTTTCAA CAACCTATCG AAATCATCGT AAACGCTTCG GATTACGAAT GAATTTGATT 
GCTGGCATTA TCAATTATGA ACTAGGATTC TAGTTTTGCA GGAAGTCTAT TATTTTCCTT 
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ATTGTCTGTA AGTCTACTGA CCTTGTTGTT TATCCCAGTC ATGGTTTCTA GTTCGGGCTC 14 40 

AGAGTTTCAA AGTGGATGGC AAGAGCATCA ATTGATTGCT GAGAAGGTTA GTAAAACACT L500 

TGACAAGACA TTTGATAAGG ATGTCAGAAA AATTCCGACC AGTCCAGTTT TATCAAAAAT 15 60 

TTGTAGATGA GATGGGAAGG ATTTACTC 1588 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



CTGTTTTTAA 


TACAGCTAAC 


ATCTCTTCAA 


AAGTGGTACG 


CTGAACACCA 


ACAAGACGCT 


60 


TAAATCGTGC 


ATCAGTTAGT 


TGTTTACTTG 


CTTCATCATT 


CATAGAACTA 


CTATACCATA 


120 


TTTTGTTTCG 


CAGGGAGTCT 


AATATTGTCA 


AATACTGGAG 


CGCTCATTGC 


TGGTATACGG 


180 


AATAAGATTG 


GCCCAGCTTC 


GATAACTGGG 


AT AC CTGGTT 


CAAAACCAAG 


ATCTGTTGCA 


240 


GCGATTGGTG 


TAAAGATATC 


GTAACCTTTC 


ATAAGGTCTT 


CGTTTACATC 


TTTCACCATG 


300 


ACTGCATCAC 


AGTGAACATC 


ATAACCACGG 


TTTGAAAGTT 


CTTCTTCTAG 


AGCACTTTTA 


360 


ATTTGGTGAC 


TTGAGTTAAC 


ACCTGCACCG 


CAGGCAGCAA 


GAATTTTAAT 


CATTTAGATT 


420 


TCCTCCGATT 


TTATTTTTTA 


ATAGACAAGA 


TTAAGCGGTT 


GCTTCAGCAA 


TGTAAGCATA 


480 


AAGTTTTTCT 


GGTTCGGAAA 


TTTTTGATAG 


GTCTTCAAGA 


TGTCCATTTC 


CTGTGAAAAA 


540 


GTCCATCAAC 


TGAGCC AG AA 


TATTTGTTTG 


ACTTGAACTT 


GAGTTATTGA 


TGATAAAGAA 


600 


GAGCAAGGAT 


ACTTCTACTT 


CCTTATCAGG 


AGCTATCATA 


TTGTGAAAAG 


TTACTGATTT 


660 


TTCTAATCGA 


ACAACCACCA 


CTTTTCTCAG 








690 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 : 



ACTGCTACCT TTAAGAAGAT AGTAGACGTA TATACTTTTT TAAGAAAATC AAAAAGATAC 
TATAAAAAAT CTATTTGTTT ATTGAATTTA AGACTTTGGT AACAAATTGA AAATAAAAAG 
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GAGGTATTCA TCATGAATAC AAAAATGATG TCACAATTTT CTGTTATGGA TAATGAAATG 
CTTGCTTGCG TTGAAGGTGG AGATATTGAT TGGGGAAGAG AAATTAGTTG TGCAGCAGGG 
GTTGCATATG GCGCAATTGA TGGGTGTGCA ACAACGGTTT GATATTTCTA TTGGGACCAT 
TTGCTATAGG AATAGGTGTA ACTGGTGCTG CAGGTGGAG 

( 2 ) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CTGAAGACCT CGTGCTACTC CTCCTCCAAC ACCTGCTTTG GCAAAATCTC CCCAATTGCA 
TCCGCCACCT TCAACTCAAG CAAGCATCTC AGTATCCATA ATTTCAAATT GTGACATCTT 
IJ™ ATTC ATAACGAATA ^TCTTTTTT ATTTTTAATA TTTGTCTTGT TACAAACTTG 
ACAAGTTTAG TATAACAGTA TCTATTAATT TTTTTCATCC AAATCTTGAA TTGGCATCGA 
AACGTCTTGA ATTAGCTTTT TTGTTTCAAA ATCATCTCTA TTTTTAAAAA AGATGTTTTC 
TAATCACrTT TTTACTATTT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 
ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 
GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATATTCT CAACCTTGCT TCTCTCCTTA 
GATAGCGCAT GATTACAGGC TTTATCTTCA ACTGTTAGCG GCTTGAGTTT GCTGGATTTA 
CGTGGAAGTT TGTGCTTGAG GATATATCTT CATGAGCCCT TGATAACCAC TGTCAGCCAA 
GATTTTACCA GCTTGTCCGA TATTTCTCAT TTCTAAAAAC CATCTACTTC CGTTGACTTG 
GAACTCCGAT TGCTATTTTC CTTGAATGAT TTTAACCACA TCTCCTACAC TTTGGAGTTG 
GTCAATTTCC TCATCGCTGA TTTCGATACT AAATTCATCC TCCAGCGTCA AGATAAACTC 
CATCAAATCA ACTGAGTCAG CATCCAAGTC GTCTTTCAGA CTCAAGGATT CTGTC^oIc 
AAAGTCCTCT CCCTGTCGCT C TTGGAT AAT GGTCACAATA CTGTCAAAAA TTTCt£££ 
TCTCATCTCT TTTATTCTCC TGAAAATTCA CGCGCAGTCT GGGCAACTAC ^^CT 
AGCATGGTAC GAATCTGGCG AATCGTACTA TAAACAGCCT TGGCATCGCT Z^IZH 
GTCT^GACAA CAGGTGCCTT GACACCAAAC AAGACCGCTC CACCAACATC ^T^TTG 
AGCTG.TTTT TCAAACCTCT GAGGCTGTCC TTGAGAAGGA GGGCACCTAG TTTCGCTCGA 
AGACCACCAC CTGTAATAGC TGTCTTGAGC AAGCCCATGA TTC CC ATAGC TGTCCCTTCG 
ATGGATTTGA GCACAGCGTT TCCCGTGAAA CCATCTGCCA CAACAACATC TGCAACGCCA 
ZTZT^' CACGCGCTTC ^GTTTCCG ATAAAGTTCA AACTTTCATC AGCCGCCAGT 
AATTCATAAG TTTCCTTACG AAGCGGGTCG CCCTTGCTAC TCTCIGTTCC G^TCTTGAGC 
AAACC^CAC GTGGTTGCGC AATGCCACGA ACATTCTTGG CATAGAAAGA AC^ACA 
GCGTATTGAT GGAGGTGCTG GGCTGTATTT TCTGCATTAG CACCGAGGTC AAGCATGTCA 
AAACCTrrCC CATCTACAGT CGGCAATGTT GACATAAGTC CAGGACGGTC GATATTCTTG 
ATACGACCCA CGATGAAGAA TCCAGCAGCC AACAAAOCAC CTGTATTCCC AGC^gIaIgG 



180 
240 
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ACAGCGTCTG CTTCACCATC TTTGACAGCC TTGGCTGCCA ATACCATACT GGCATTTTTC 
TTATTCCGAA TAG 



1680 
1693 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



CTAAATCTGC 


TTGCTTAGTC 


CACTTGCTTG 


AGC CAAGGAG 


TCACATACTC 


AAAATTTTCA 


60 


TGAAAGTCAT 


AGGT AC CGTC 


TTCTTTTTTA 


GCTGAAAAGA 


AAAGTCCATC 


GTGGTAGAGC 


120 


AAATCCCCGT 


GGGCCATAAT 


TCTGGCAGTT 


TTTTCCTCTT 


CCTACTCCTG 


AGACTTTTGC 


180 


TTAGTCCCCT 


CTTGAGAAAT 


AGTATCTCGT 


TTTTGAC TAG 


TCAAGGGATT 


CCTTGGAAGC 


240 


TTTCAAACAA 


CAAGACCAAG 


CCCATTGATA 


AACCAACTGC 


TAGCAAGAGT 


ATCGCCACAA 


300 


ATCCCTTATT 


GCTCCACTTG 


CGATAACTCC 


TAAAAAGTTT 


ACCAAGCCCT 


TCATAAAACG 


360 


AAAAGCTAAA 


CCACCCTGAT 


TTCGATTTTG 


TCTTCTTTGT 


ATCTTCGTTC 


TCCCTACTTT 


420 


CTTATGCAAG 


CCTTTTCTTT 


TTATTATATC 


ACAGATAAGT 


ATTTCTTTCA 


CAATTGAATT 


480 


GAACTTCCCA 


TCTATTTTCT 


ATAAATCCTA 


AATGC C AT AA 


TGCTTTCAAT 


TCCTGTCATT 


540 


TTGTGATATC 


ATGTAGAAGA 


AATGAACTAA 


TCCACAGTGG 


CTTATTCCAA 


GTATACCACT 


600 


TGGGCTTTGG 


CAGTAG 










616 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



CTAATATAGA 


ATAATCACCG 


CCGTTGTGAA 


AGAACGATTG 


GATGATAATC 


CAATCGTTCA 


60 


GGGAAATTGG 


AAGACCTTGG 


GTTTCCAATT 


TAGGCATGAG 


ACACCTTTGG 


TGGCTGCTGC 


120 


CGTCCCTCAC 


AAGCTAAGGT 


GATTGTTGAA 


AAAGAGGAAA 


AAGGAGAAGA 


AATGAAACCA 


180 


GTAATTTCCA 


TCATCATGGG 


CTCAAAATCC 


GACTGGGCAA 


CCATGCAAAA 


AACAGCAGAA 


240 


GTCCTAGACC 


GCTTCGGTGT 


AGCCTACGAA 


AAGAAAGTTG 


TTTCCGCACA 


CCGTACACCA 


300 
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? CATCATCGCA 


360 


: CCTTCCAGTC 


420 


k TTCTATCGTT 


480 


i TTTTAGGATA 


540 


' GTAATGTTTT 


600 


' AAGGCATGCT 


660 


CATCCTTCCA 


720 


. AGCTTGTAAT 


780 


CGCCCCTAAA 


840 


CAAGTCATTG 


900 


ATTACACGAG 


960 


TTAGATCCTT 


1020 


TTCCAAGTAT 


1080 


GAGTAAAATG 


1140 


TGTAGTTGTC 


1200 


TTCTGGACAA 


1260 


CAGAACTTGT 


1320 


GGTTC TATTT 


1380 


ATCGACCAAC 


1440 


CTTTGACCGT 


1500 


CCTCCAAGCC 


1560 


TCTGATTCTT 


1620 


CCCATTGGCT 


1680 


CCAGCAGAAA 


1740 


AGCCGTGATT 


1800 


GCGCGTGCGC 


1860 


GAATGGGATG 


1920 


GAA 


1973 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTAAACCAAT AGTTGCTGAA GACCATAATC CAGCTGCTGT TGTTAAACCT TTAACTTTTT 
GATTAGTTAC CATGATTGTT CCTGCCCCTA GGAAACCCAA GCCACTAATT ACTTGGGCTC 
CCATTCGACT AGGATCACCG CTACCATAAC GACTAGTGAT GAACTGATTT GTCATCATTA 
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LnnLALnnvj X 


TrrrA a hp a a 


AC- 1 AG X AAvj x 


AGGTTC T AAT 


CCCTGCTGCT 


TGGTTTTTGA 


240 






AITjCCACCGA 


AAAGAATCGC 


TAAAAAGCAC 


CTAAGAAGTA 


300 


TTTPrra a at 

XXX UV^L/Vh/ii 




1 A I G AAAGAT 


TCATATTATC 


TCTTACCTCG 


TTTACCTTGG 


360 


n/i 1 nuviC 1 lb 


AT , AAA r PAR A 
A 1 AAA X MAAb 


AGC rGCACTA 


GACATAATCA 


TTAAAATTAC 


AGAATAAACA 


420 


AAV-A 1 V.. A 1 ILj 




I AAAGTTGCT 


GTTTCATCAG 


TAGACTGTTT 


AATAACGATT 


480 




pp. A A A AP'T'P.P 


A 1 GG 1 AC AAG 


AATACAGATA 


AGTCATAGTC 


AGATAATAAA 


540 


GAATTAAAGT 


TTAArAPAAr 


T A CVCZ. A T A A T 
X AL X on X AA 1 - 


AC AC TGGTA 


AAATATACGG 


TATAATAACT 


600 


CC3CACCATAG 


Tt"5TA<T?AAAPT 

X Vj J. /T.VjTXTLfA-iV* X 


AUA X VjV- AV—V_ C 


A I ACT ACGTG 


CAGCTTCTTC 


CATATCATTA 


660 






V-.UVjAA1V_.AX X 


CTATAAGAAA 


ATGGTAATTT 


TTGAATAGTA 


720 


TATf^rAATAA 
A .rv x ovnA x nA 


HTAAAATIAT 
ys X AnAn X HA X 


X AC l\j 1 Xv_C 1 


ACT AAAAC T A 


AATTAAATAG 


TATTAAATGA 


780 


f^HTATATTAT 


A APHTP. A AP AT 
AAu IvjAALA X 


lAA-LLiAGT 


GCAATCAATG 


TACCTGGCAA 


TATCCATGGT 


840 


AX IvjLAL 


f^TAPTr'A A A 


bAATTTATCG 


AACTTACTCT 


TGTTTTTATG 


TACAATACGT 


900' 


pa aaTTirTa 


1 X vjC 1A1 AA X 


TG T AGC AATT 


ACCGCAGCTA 


AAATTGCATA 


AACAACGCTG 


960 


AC x ACiCj I AAb 


Cj AC G AAATG A 


TTGAGCATCT 


GTAAATAAAT 


TAGCATAGTT 


CGCTAACGTA 


1020 


AATTTAGATA 


GATCTAAAGT 


TCCCGTCTTG 


ATCGTCAACG 


AGTCTGTAAA 


CGAGTATAGA 


1080 


ATTATCAAAA 


CTATTGGTAA 


CATATAGATT 


GCAAACAATA 


CATACGCAGC 


AATGTGAGCA 


1140 


AGAATTATTC 


CATAATGGAG 


AAGAAAATTA 


TCTGCTTCCT 


TAAT AC TAG C 


CTGGTTTTCG 


120C 


AAACAGAAT 












1209 



<2) INFORMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 



CTAAGTCCTT 


TTAGTTTTAT 


CTTAATTCTC 


TTATTGTTGT 


AATAATCAAT 


ATAGTCTATA 


60 


ATGGCTTGTT 


CCAATTGCTT 


AAGCGACTGA 


AACGACTTCT 


CATAACCGTA 


AAACATTTCC 


120 


GATTTCAGAA 


TCCCAAAGAA 


AGATTCCATC 


ATAC CGTTGT 


CTTGGCTGTT 


TCCCTTGCGT 


180 


GACATAGATG 


CTTGAATTCC 


CTTACTCTTA 


GGAACCGATG 


ATAAGAATCG 


TGTTGGTATT 


240 


GCCAGCCTTG 


GTCACTATGG 


AGAATCGTAT 


TCTCGTAGTG 


CTTCTCTGTG 


AATGCCTGTT 


300 


CCAACATTGT 


TTGTACTTGT 


TCTAAGTTGG 


GTGAAGTTGA 


AAGATTATAG 


GCGATAATTT 


360 


CGCTATTAAA 


GCCATCTAAA 


ACTGGTGATA 


AGTAAAGCTT 


TTGAGTACTT 


GCTGG AATGG 


420 


CAAATTCTGT 


CACATCTGTG 


TAGCACTTTT 


CCATTGTTTT 


AGAGCCTTCA 


AATTGGCCTT 


480 


GAATGAGATT 


CTCTGCCTTC 


TTACCAACGT 


CTCCTTTATG 


AGAAGATTTC 


GTTTCTGTCG 


540 


CATTTTAGCT 


TGTAAATTGA 


GTACTTTCAT 


CAAGCCTTGA 


ACTCTTTTAT 


GATTTACCAG 


600 


ATAAGCACGA 


TTCCTTAGTT 


CTAAATGAAT 


ACAGCGATAA 


GCATAATTTC 


CCTTGTGTTC 


660 


GATAAAAATG 


GATTGAATTT 


CAGCTTTAAG 


CTCTTGGTCC 


TTATCTGGTT 


TGTNTAG 


717 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 851 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

CTAAATCACC AGATTTAACC AAGTCGTAGA TACGGCCTGG TGTTCCAATA ACAATATGAG 
GCTGATTCCT TGCCAATTTC TCAATCTGGC GAGCCTTATC CGTACCACCC AC^^ 
CCACACGAAC TTCGACATCT GAGTGAGCTG AAATCTGACG CGCTACTTGG TAAATTTGAG 
l^ZT CTC ACGACTCGGT GCAGTAATCA CTGCTTGTAC ACTATCGCTA GC^T^A 
ATTGCTGGAA AATCGGTAAC AAGAAAGTAT GAGTCTTACC TGAACCTGTT TTtIaTTCT^ 
CTACTAGGTC ACGACCTGCC AAAACAATAG GAATCAACTT GTCTTGCACC ZIJ^H 
TTGTAAATTT TAACTCCTCC AAGGCTTCTC TAATATAGTT TTTAAATTGA AATTTCGTAA 
ATCACATAAC ATCCTCGATT CTATCTATCT TATCAATTAT ACCATA^TT TZ^Tc 

AA™^ CTATTTCCAG TAGCTOCT AGTAAGAAAA -™ 
TTATAGTTCC AACCTCTTTT CAGTTATTAT TTCCAGTTTA ACATAGCATT CAAGCCATAG 

AACTCTTTGG TAGTAAAGAC ™^ 
AACTCTTTGG TAGTAAAGAC ATAATCGATT CGAAGGGGTT CAGTGTTCCC TTTCCAGCCA 

TCAATTTCAG GCGGAACAGT ATAGCTACCA CTTTTCTCTT GAGCAACTTC IHScGtS 
"ll — GC TAAAATAGCT CCCTGACCTG T^ZZ 340 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 13; 
CTGTTGAGAT TGTTACGAAA TAACTGAAAA TATTTAAGGA GAAAATATAT GTTGAAAPAr 

= = ~ = ™ = 

GCAGGAAAAT ^CT^ GGATGCTTTA =C„CT ATC = 
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WO 

TTGCCAGTTG TTATTGCTAC AGGTTTGTCT 
CCAGGTTTTG TTGTTGGTCT AATTGCCAAT 
TTGGGAGGTT ATATAGTTGG TTT CTTGGTT 
AACTGGATTA AAGGTTTAAT GCCAACCTTG 
GTTTGATTAT GATTTATATT ATTGGGGCGC 
G 

(2) INFORMATION FOR SEQ 



PCT/US97/053O6 

TACTCGATTG CTGGTAAGCC AGGGATTGCA 300 

TCTGTTGGTT CAGGGTTTAT CGGTGGTATC 360 

CAAGCGATTA TTAAAAAGGT CAAAGTACCA 42 0 

ATTATTCCTT TTGTACCTCT TTGGTAAGTA 480 

CTATCGCAGC CTTTACCAAC TGGTTGACGA 54 0 

541 

ID NO: 14 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CTGGAATGAG 


TGTATAAGCC 


CAGCCCAAGT 


TTTGCATCCG 


TTCAAAGTTC 


CAAGACCCTT 


60 


GTAAGAAGGT 


TGAACGCCAC 


CAAACTTTTT 


TACGATCTGA 


TTTAGTTAAT 


TGAAGTTTTT 


120 


CAGTCATGAT 


GTTTTCAGTC 


CTTTCTTATC 


TTAGTAGTCT 


TCTAGGATAT 


CGCCGATTGG 


180 


GTCGTTAGAA 


GTTGCGGCTC 


CTCCGCCACC 


ATTTCCACCA 


GTTTTAGAAA 


GGTGAAGGTA 


240 


GATAAGAGCG 


ATAGCAACAC 


CGATAGCACC 


GAATCCGATT 


AGAGTAATAT 


CTGACACAGC 


300 


AGCGAGAACG 


AAACCAAGAG 


CGAAGAATGG 


C C AT ACTTC A 


CGAGTTGCCA 


TCATGTTGAT 


360 


AACCATGGCG 


TAACCAACGG 


CAACG AC CAT 


ACCACCACCG 


AT AGC CAT AC 


CATCTTTGAG 


420 


CCAGTCTGGC 


ATGGCACTAA 


GGATACTTTG 


TACAGTTTCA 


GTTGGTACCA 


TAAGGAGAAG 


480 


AGCTGCAGGA 


AGCGCGATAC 


GAAGTCCTTG 


GAAAAGTAGC 


GCAATGAAAT 


GCGCACGCTC 


540 


CACAGCGCCG 


AAGTCACCTT 


TTTTAGCGGC 


AGCATCTGCA 


GTATGAACCA 


AAC C AACTG A 


600 


AATTGTACGA 


ACAATCATTG 


TCAAGAAAAG 


TCCAGCTACG 


GCAAGAGGGA 


TAGCAACCGC 


660 


TTGGGCAACA 


CCGATACCAG 


TCTTGGTAAA 


GTCACCACCA 


AGAACCATGA 


TAATGGCAGC 


720 


AGCGACAGAA 


GCAAGTGCAG 


CATCAGGAGC 


GATAGCAGCA 


CCGATATTTG 


ACCAACCAAG 


780 


GGCAATCATT 


TGAAGCGATC 


CACCGAGGAT 


AATCCCTGCT 


TCCAAGTGGA 


CCTGTTACAA 


840 


GCCCAATAAG 


GGTACAGGCT 


ACAAGTGGTT 


GGTGAAATTG 


GAACTGGTCG 


AGGATGCCTT 


900 


CAAGACCTGC 


AAAGGAAGGC 


TACAACGACT 


ACTAAAACCA 


TAGAAATAAT 


AGACATGTTT 


960 


AAAATCCTTT 


CATAAATAAT 


GGCTTATTTG 


ACATTGGCTT 


TGTTAATCAA 


GTCAAACAAA 


1020 


TCTTTTTTAG 


AATCATTTGG 


TACTTTACGG 


ACATCAAATT 


CAACACCCAA 


GTCACGCATT 


1080 


TTTTCAAATG 


TAGCAACATC 


TTCTTTGTCC 


ATAGACAAAA 


CGGTATTGAC 


CAATGTTTTA 


1140 


CCTGTTGAGT 


GAGCCATAGA 


ACCAACGTTA 


AGAGTCTTGA 


TTGGCACGCC 


GCCTTCGATG 


1200 


GCACGAAGGG 


CATCTTGAGG 


TGTTTCAAAC 


AAGATAAGGG 


CATGTGTTTC 


TCCAAAACGT 


1260 


GGGTCTTTTG 


AAATATCAAT 


CAGTTTTTGA 


ATTGG AAC C A 


CGTTAGCCTT 


G AC ATT ACC T 


1320 


GGAG 
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(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii> MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCATAAGGAA GCTGTCGCTC GTTCCGCTAA GGTATGGACA CCACGGTGAA CATTGGCATT 
GTCCTGCTCA TAGTAACTGT TAATAGCTTT CAGAACTACT AGTGGTTTTT GTGTCGTCGC 
AGCATTGTCC AGATAGACCA GAGGTTCATC ATTGACAATC TGATCTAAAA TTGGAAAATC 
CTTGCGAATC GCTTCTACAT CTAACATAGG CTTCCCCTTA GCGTTTTGAC AATTTCTCTT 
CGATAGTTGC AATCATTTCA TCACGAACTT CCTTGACTGG AATCTCCACG ATAACAGATC 
CAAGGAAACC ACGAACAACC AAACGCTCTG CAGTTGCCTT ATCCAATCCA CGACTCATGA 
GGTAATACAT GTCTTCTGGA TCAACTTGTC CGATAGACGC TGCGTGTCCT GCAGTGACAT 
CATTTTCATC AATCAAAAGA ATTGGGTTAG CATCTGAACG CGCTTGGTCT GAAAGCATGA 
GAACACGGCT CTCTTGTTGC GCATCTGCTC CCTTAGCACC CTTGATGATG TGGCCGATAC 
CATTGAAAGT CAAAGTTGCT TTTTCAAGGA TAACCCCATG TTGTAGGATA TTTCCGATAG 
AGTTGCAGCC ATAGTTAGTT ACACGAGTAT CAATCCCTTG TACCTGACGA CCACTTGAAA 
GAG 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CTAGTTGGAT GGC TTCAAT A AAGGATGATT TGGCTGCTCC ACTATTGGCA ATGAGCTGAA 
AACAGATATA TTCCATTTCT TCGTCATCTT ATTTCTCCTA TCCATTCAAG TGCTTGTTCC 
AGAACTTTTG CTCCATTCAT CATTCCGTAA TCCCGCATAT CAATGGTATC TACAGGGATA 
TTTCCTGCAA TTTCTTTCAC AGCAAGTAAC TCATAACGAA TTTGTGGCCC AATTAGAATG 
ACATCTGCTT CATGGATATT CTTTTTAGCT TCTGTCATTG ATTTTGCTTG GATAGAAATT 
TCAATCCCAC GTTCAGTCGC ACTTTGTTGC ATTTTTTTAA CAAGCATACT TGTCGACATT 
CCCGCATTAC ATACTAATAA AATTTGTTTC ATAATCTTAA CCTTCCATTT CT^TTCAAC 
AACTTTGTCA TTAACTTTGA TAAATGGAAT GTATAGAAGA ACTCCAAGTG CAAAGATGAT 

63 
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GAATTGAACT AGAACTGGTC TCACGTCCCC TGCTGTTGCT AACCATGCAT TTAAGAATAC 540 

TGGTGTAGTC CAAGGAACTT GTATAAATGC AGGACTCATG AATCCTGTAA CTGTTGCTAA 600 

GTAGCTGATT AAAATACCAA GGACTGGAAC TGTGATAAAT GGAATAGTC 64 9 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CTTGGTATTA 


TTTGAAATCA 


GATGGTTCTT 


ATGCAAAAAA 


TGCATGGCAA 


GGAGCTTATT 


60 


AC CTTAAATC 


AAACGGTAAA 


ATGGTACAAG 


GTGAGTGGGT 


TTATGATTCT 


TCTTACCAAG 


120 


CCATGGTATT 


ACTTGAAATC 


AGATGGTTCA 


TATGCTCGCA 


ATGCATGGCA 


AGGAAACTAC 


180 


TATTTGAAAT 


CAGATGGTAA 


AATGGCTGTC 


AATGAATGGG 


TTTATGATGC 


CACCTATCAA 


240 


GCATGGTATT 


ATTTGACATC 


AGATGGTTCT 


TATGCTTACA 


GTACATGGCA 


AGGAAATTAC 


300 


TATCCTAAAA 


TCGGATGGTA 


AAATGGCTGT 


CAATGAATGG 


GTTGATGGTG 


GACGTTATTA 


360 


TGTTGGCGCT 


GACGGAGTTT 


GGAAGGAAGG 


TCAAGCAAGT 


AC AGC TTCTC 


CTAGTAATGA 


420 


TAGCAATAGT 


GAATATTCCT 


GCTGCTTTAG 


GAAAGGCAAA 


AAGTTATAAT 


TCGTTATTCC 


480 


ACATGTCAAA 


AAAAACG 










497 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 18 : 



CTAGTCTAAA 


TTTTTTAAAA 


CAAAGGTCAA 


AGATAGTCAA 


TATCAGTAAT 


CATAACTAAG 


60 


TAAACAAAAA 


GAGGTAAAGA 


ATATGAATAA 


CAACTTTAAT 


AATTTTAATA 


ACATGGATGA 


120 


TTTATTTAAC 


CAATTGATGG 


GTGGTATGCG 


AGGATACAGT 


TCTGAAAATC 


GCCGTTACTT 


180 


AATTAATGGA 


CGCGAAGTCA 


CACCTGAGGA 


ATTTGCTCAC 


TATCGTACGA 


CTGGTCAATT 


240 


ACCAGGAAAT 


GCAGAAACTG 


ATGTGCAAAT 


GCCACAACAG 


GCATCAGGTA 


TGAAACAAGA 


300 


CGGTGTCCTT 


GCAAAACTAG 


GTCGAAACTT 


GACAGCAGAA 


GCGCGTGAGG 


GCAAGTTGGA 


360 
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AGAAACATCT 


GAAATCCTCT 


CACGCCGCAC 


420 


AGGTGTTGGT 


AAGACAGCAG 


TTGTCGAAGG 


480 


TCCTGCTGCT 


ATCAAGAACA 


AGGAAATTAT 


540 


TACTCAATAC 


CGTGGTAGCT 


TTGAAGAAAA 


600 


AGCAGGGAAT 


ATTATCCTCT 


TCTTTGATGA 


660 


TTGTGGAGAC 


AGTGGTTCTA 


AAGGGCTTGC 


72 0 


GGAGAATTGA 


CAGTGATTGG 


GGCAACAACT 


780 


AATGCTGCTC 


TTGCTCGTCG 


TTTCAACGAA 


840 


ACTTTTAAAA 


TTCTTCAAGG 


CATTCGTGAC 


900 


CCAGACGAAG 


TCTTGAAAGC 


AGCGGTGGAT 


960 


TTGCCAGATA 


AGGCTATTGA 


CCTTGTCGAT 


1020 


CCAGTAACAG 


ATGTGCATGC 


TGTTGAACGA 


1080 


AAAGCAGTTG 


AAGCAGAAGA 


TTTTGAAGCA 


1140 


TTGGAAAGGA 


AAATCGAAAA 


CCACACAGAA 


1200 


GTGGCTGAAT 


CTGTGGAACG 


AATGACAGGT 


1260 








1282 



ATCCCAGTAT CGCAAATGGG AG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTTGATATAA TTCTTGTTTC GAAAGATTCT CCTTAGGTAT ATCTATTCCT CCACTAGTAA 
ACGGTAATTC CAAAACAGAG TTTACTTCGT TAAATGTAAG CCAATATTTA ACTTTATCTT 
TATACCTTTC TAAAACTGTT CGAGCAAATT TTTCATAAAA ATGAATCATT CTTCCTATcI 
ATCCATCCAT GATATTTTCT TGCTAAATAT AATGGAGTCT CATAGTGTGA AA^GTtIcA 
AGTGGTTCTA TCC CGTGAGC ATGTAGTTCA TCAAACAATT CATCATAATA TTTCAATCCA 
GCTTCGTTAG GTTCTTCCTC ATCTCCTTTT GGAAAAATTC TACTCCATGC AATAGAAGTA 
CGAAAAACAT TAAAGCCCAT TTCAGAAAAC AAGGATATAT CTTCCTTATA TTTATGATAA 
AAATCAATAC CTATCAATTT TAAGTTATCT TCTGTAGGAT TTTCTGTTGC TTCTCC^ 
CCACCTTTGG GTAACACATC CTGAACTGAT AAGCCCTTAC CATCTTCATT ATATGCTCCC 
TCTACTTGAT TAGCTGCAAC AGCTCCACCC CAAAGAAAAT CATCTGGAAA AATGGTCATA 
ACTTTCCTCC ATTATAATAT TACCAGTAAT TCCTTAGAAA TGCTCGATTG TC^ A " 
AGGTAATATT AATACATCTA GAAAATCATT GGTATTCGTT ACAATTACTG GTGTAACTGT 
TTCGTAGCCT TTAGTCTTGA TTAAATTCAA GTCCATTTCA AAAATCAACT C^tIZ 
AACTCTGTCT CCTTCTTCTA CATGACTAAT AAAACCTTGA CCTTTTAG 
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(2) INFORMATION FOR SEQ ID NO: 20: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AGCTATGGTC 


AGACTCAGAT 


TGATGGCGTT 


GCTTATGCCA 


AGTACGATAT 


CTTC CGTTT A 


60 


AAGAACGGGA 


AAATTGTGGA 


GCATTGGGAT 


AATAAGGAAG 


TCATGCCTAA 


GGTAGAAGAC 


12 0 


TTGACCAATC 


GAGGGAAGTT 


TTAAATTGAG 


it n it "it f"* Ti it rri 

G AC AAAG AAT 


GATTGAATAC 


it it it it it 

AAAAATGTAG 


ion 


CACTGCGCTA 


CACAGAAAAG 


GATGTCTTGA 


GAGATGTCAA 


C TTACAGATT 


GAGGATGGGG 




AATTTATGGT 


TTTAGTAGGG 


CC TTC TGGGT 


CAGGTAAGAC 


GACCATGCTC 


* is a (Tv^ TV rprp ft 

AAGATGAT I A 




ACCGTCTTTT 


GG AAC C AACT 


GATGGAAATA 


mmm 7s <T1 ft ^VT/"^ ft 

TTTATATGGA 


TGGGAAGt-GC 


ATLAAAUAL 1 


fs n 

j D U 


ATGATGAGCG 


TGAACTTCGT 


CTTTC T AC TG 


GTTATGTTTT 


AC AGG G 1 A 1 X 


GG Lllliv. 


/i 7 n 


CAAATCTAAC 


AGTTGCGGAA 


aatattvjC rc_ 


ILAI \ 1GA 






480 


AAGAAATTAC 


GAAGAAAACA 


GAAGAACTTT 


TGGCTAAGGT 


TGGTTTAC C A 


GTAGCCGAGT 


54 0 


ATGGGCATCG 


CTTACCTAGT 


GAATTATCTG 


GTGGAGAACA 


GCAACGGGTC 


GGT ATTGTC C 


600 


GAGCTATGAT 


TGGTCAGCCC 


AAGATTTTCC 


TCATGGATGA 


ACCCTTTTCG 


GCCTTGGATG 


660 


CTATTTCGAG 


AAAACAGTTG 


CAGGTTCTGA 


CAAAAGAATT 


GCATAAAGAG 


TTTGGGATGA 


720 


CAACGATTTT 


TGTAACCCAT 


GATACGGATG 


AAGCCTTGAA 


GTTGGCGGAC 


CGTATTGCTG 


780 


TCTTGCAGGA 


TGGAGAAATT 


CGCCAGGTAG 


CGAATCCCGA 


GACAATTTTA 


AAAGTGCCTG 


840 


CAACAGACTT 


TGTAGCAGAC 


TTGTTTGGAG 


GTAGTGTTCA 


TGACTAATTT 


AATTGCAACT 


900 


TTTCAGGATC 


GTTTTAGTGA 


TTGGTTGACA 


GCTACAATGA 


CATTGGTCGG 


TTCCTTGAGC 


960 


AAGAGATAGA 


TTAGCCAGAC 


AGTCATGCCC 


AAAATCCCTC 


CAGGTAAGAG 


CATAGACCGT 


1020 


TGCACATTAA 


GTACGATTAA 


AAAAGTGATA 


ATGGCAAGAA 


AACTTGCTAC 


TGCTTGTAAT 


1080 


AAAAAGGTTG 


TTAGTGTCAT 


ATTAGTTCAT 


CAATACCAAG 


GCGACAGAAG 


TTCCTGCCCC 


1140 


TAAAGCGAGG 


GTAATGAGCA 


GGGATTCAAA 


CATCTTACTC 


ATACCAGAGT 


TTATGTGGTT 


1200 


GGTCATAATA 


TCACGGACCG 


CATTGGTCAA 


GGCAATACCT 


GGTACAAACG 


GCATGACCGC - 


1260 


ACCAGCTATA 


ATCAAATCCT 


GCCCGTTTGA 


ATGGAAAAAC 


CCTGTGTTAG 


CCGAGCCCCA 


1320 


AAACTGGGGC 


CAATTTATCC 


CCCAAAGACA 


AAAGCTCCAT 


CAAAGGCTGT 


CACAAAGGGA 


1380 


ATTCGGATAA 


ATTTTCCACA 


TAGAAGGAAA 


AGGCAAAACC 


AAATAAGGTC 


GCCACTCCTG 


1440 


CCCCAAGTGC 


TCGTAAATAT 


TCCGCT 








1466 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



WO 97/37026 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AAGGTTCTTC GTAAATGTAC ACCATTCGGA TGCTCAAAAT GGGTTCTGAA GCTGCTGCTA 
AATCTGCACA AGAACACGGT CTTAAATCAG TTGAAGTTAC TGTAAAAGGT CCAGGTTCTG 
GTCGTGAGTC AGCTATTTCG TGCGCTTGCT GCCGCTGGTC TTGAAGTAAC AGCAATTCGT 
GATGGGACTC CAGTGCCAAC ACAATGGTGC TTCGTCCTCC AAAACGTCGC CGTGTATAAT 
CATCGCATTA CACTGCTTTT CGTTTAAGAG GGAGTAACTA AATGATCGAG TTTGAAAAAC 
CAAATATAAC AAAAATTGAT GAAAATAAAG ATTATGGCAA GTTAGTAATC GAACC AC TTG 
AACGTGGCTA CGGTACAGCT CTTGGT AAC T CTCTTCGTCG TGTACTTCTA GCTTCTCTAC 
CAGGAGCAGC TGTGACATCT ATCAACATTG ATGGTGTGTT ACATGAGTTT GACACAGTTC 
CAGGTGTTCG TGAAGACGTG ATGCAAATCA TTCTGAACAT TAAAGGAATT GCAGTGAAAT 
CGTACGTTGA AGACGAAAAA ATCATCGAAC TGGATGTTGA AGGTCCTGCT GAAGTAACAG 
CTGGTGACAT TTTGACAGAT AGCGATATTG AAATTGTAAA TCCAGATCAT TATCTCTTTA 
CAATTGGTGA AGGTTCTTCT CTAAAAGCGA CTATGACTGT TAACAGTGGT CGTGGATATG 
TACCTGCTGA TGAAAATAAA AAGGATAATG CACCAGTTGG AACACTTGCT GTAGATTCTA 
TTTATACACC AGTTACAAAA GTCAACTATC AAGTGGAACC TGCTCGTGTA GGTAGCAATG 
ATGGTTTCGA CAAATTAACC CTTGAAATCT TGACAAATGG AACAATTATT CCAGAAGATG 
CTTTAGGGCT TTCAGCACGT ATTTTGACAG AACATCTTGA TTTGTTTACA AATCTTACTG 
AGATTGCTAA GTCAACTGAA GTGATGAAAG AAGCTGATAC TGAATCTGAC GACCGTATTT 
TAGATCGTAC GATTGAGGAA CTGGACTTGT CTGTGCGTTC ATACAAC TGT TTAAAACGTG 
CCGGTATCAA TACTGTGCAT GATTTGACAG AAAAATCTGA AGCAGAGATG ATGAAAGTAC 
GAAATCTTGG ACGCAAGAGT TTGGAAGAAG TGAAACTCAA ACTCATTGAT TTGGGTCTTG 
GATTAAAAGA TAAATAAAGG AGGAATACAT GGCTTACCGT AAACTAGGAC GCACTAG 

(2) INFORMATION FOR SEQ ID NO:22r 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CTATTGAAAC AAGAAAAATA GAGAATCAAA GAAAGAGAAC TTATGAATAT TCAAGAAGAA 
ATTAAGAAAC GTCGTACCTT TGCCATTATC TCCCACCCGG ACGCGGGGAA AACAACCATC 
^»X»Or TACTCTAACT TTGOGCGTGA GATTCGTGAG SCTGCTAC^C TAAAAGGGAA 
GAAAACAGGG ACTTTTGCTA AATCTGACTG GATGGATATC GAGAAGCAAC GTGGGATTTC 
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TGTTACTTCA 


TCTGTTATGC 


AATTTGACTA 


CGACGGCAAG 


CGCGTGAATA 


TCTTAGACAC 


300 


GCCAGGGCAC 


GAGGACTTCT 


CAGAAGATAC 


CTATCGTACC 


TTGATGGCGG 


TGGATGCTGC 


360 


GGTCATGGTC 


GTGGACTCTG 


CCAAGGGGAT 


CGAGGCTCAA 


ACAAAGAAAT 


TGTTTGAGGT 


420 


TGTGAAACAT 


CGTGGCATTC 


CAGTCTTTAC 


CTTTATGAAC 


AAGTTGGATC 


GTGACGGTCG 


480 


TnAGCCTTTG 

X VJnwV_V« XXX s-7 


GATCTCTTGC 

vJ«i X V— i X V> A X VJ V_ 


AAGAATTGGA 


AGAAATCTTG 


GGCATTGCTA 


GCTACCC TAT 


540 




ATCGGGATGG 


GGAAAGCCTT 


TGAGGGCTTG 


TATGACCTCT 


ATAACCAACG 


600 


TTTAGAGCTT 


TACAAAGGGG 


ATGAGCGTTT 


TGCTAGCCCT . AG AAG ATGGA 


GACAAACTTT 


660 


TTGGTAGCAA 


TCCTTTCTAC 


GAGCAAGTCA 


AGGATGACAT 


TGAGCTTTTA 


AATGAAGCTG 


720 


GGAATGAGTT 


TTCAGAGGAA 


GCTATTCTGG 


CTGGAGAATT 


GACGCCTGTC 


TTTTTCGGTT 


780 


CAGCCCTGAC 


AAACTTTGGT 


GTGCAGACCT 


TCCTTGAAAT 


CTTCCTCAAG 


TTTGCTCCAG 

XXX VJV, X >wV- 


840 


AAC C ACATGG 


TCACAAGAAA 


ACAGACGGTG 


AAATTGTGGA 


TCCTTATGAC 


AAGGATTTP T 

*»*»ww*\ XXX V— X 


900 


PAGGCTTTGT 

UAwvV. XXX VJ X 


OTTTAAAATV* 
x> x x n/irm x v— 


CAAGCCAACA 


TGGATC CTCG 


TCACCGTGAC 




y u U 


TTGTCCGTAT 


CGTATCAGGC 


GAATTTGAGC 


GTGGCATGAG 


TGTCAATCTC 


CCTCGTAC TG 


1020 


GTAAGGGTGC 


CAAACTATCT 


AATGTT AC C C 


AGTTTATGGC 


GGAGAGTCGT 


GAGAATGTGA 


1080 


CCAATGCCGT 


AGCAGGTGAT 


ATTATCGGGG 


TTTACGATAC 


CGGTACTTAT 


CAGGTTGGGG 


1140 


AT AC CTTGAC 


GGTTGGAAAA 


AACAAGTTTG 


AATTTGAACC 


ACTGCCAACC 


TTTACTCCTG 


1200 


AAATTTTCAT 


GAAAGTTTCT 


GCTAAGAATG 


TTATGAAGCA 


AAAATCCTTC 


CACAAGGGGA 


1260 


TTGAGCAATT 


GGTGCAAGAA 


GGAGCCGTTC 


AGCTTTATAA 


GAATTACCAA 


ACAGGTGAGT 


1320 


ACATGCTGGG 


AG 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



TTGGACGCTG 


GAACAAATCC 


AAGCTGACCT 


GTTCCAAGAC 


CAGACTTGGT 


ATGCTCTGGC 


60 


TTATGATGGG 


GCAGAAGTGA 


TTGGCTTTCT 


AGCTGTTCAG 


GAGACTCTTC 


TTTGAAGCAG 


120 


AAGTCCTGCA 


AATCGCTGTC 


AAAGGAGCCT 


ATCAGGGTAA 


GGGCATTGCG 


TCAGCCTTGT 


180 


TTGCTCAATT 


GCCGAC AGAC 


AAGGAAATTT 


TCCTCGAAGT 


CAGACAGTCA 


AATCAACGAG 


240 


CGCAAGCATT 


TTACAAGAAA 


GAAAAGATGG 


CAGTTATCGC 


TGAGCGAAAG 


GCCTACTACC 


300 


ATGACCCAGT 


CGAGGACGCC 


ATTATCATGA 


AGAGAGAAAT 


AGATGAAGGA 


TAGATATATT 


360 


TTAGCATTTG 


AGACATCCTG 


TGATGAGACC 


AGTGTCGCCG 


TCTTGAAAAA 


CGACGATGAG 


420 


CTCTTGTCCA 


ATGTCATTGC 


TAGTCAAATT 


GAGAGTCACA 


AACGTTTTGG 


TGGCGTAGTG 


480 


CCCGAAGTAG 


CCAGTCGTCA 


CCATGTCGAG 


GTCATTACAG 


CCTGTATCGA 


GGAGGCATTG 


540 


GCAGAAGCAG 


GGATTACCGA 


AGAGGACGTG 


ACAGCTGTTG 


CGGTTACCTA 


CGGACCAGGC 


600 


TTGGTCGGAG 


CCTTGCTAGT 


TGGTTTGTCA 


GCCGCCAAGG 


CCTTTGCTTG 


GGCTCACGGA 


660 


CTTCCACTGA 


TTCCTGTTAA 


TCACATGGCT 


GGGCACCTCA 


TGGCAGCTCA 


GAGTGTGGAG 


720 
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CCTTTGGAGT TTCCCTTGCT AGCCCTTTTA GTCAGTGGTG GGCACACAGA GTTGGTCTAT 780 
GTTTCTGAGG CTGGCGATTA CAAGATTGTT GGAGAGACAC GAGACGATGC AGTTGGGGAG 840 
GCTTATGACA AGGTCGGTCG TGTCATGGGC TTGACCTATC CTGCAGGTCG TGAGATTGAC 900 
GAGCTGGCTC NTCTNGGGCA GGANATTTAT GA 



(2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



932 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CTAAACTTTC GCTCATAGGC ATACAAATTA ATCCTTTGGC ATAAGTAGCC ATAAAATTAA 6 0 

CATTTTCTGT TGTAGCTGCT TGTGCAGAAC AAATTAAGTC TCCTTCATTT TCTCTATCCT 12 0 

TGTCGTCTAT AACAAGAACA AGTCGTCCCT TCTGCAATGC TTCTAATGCT TCTTGTATTT 180 

TTCGATATTC CATTGACTGA TTATCCTTTC TGCTAAAATC CATTTTGATA TAATAGTTCC 240 

TTCGATATTT CTGATTTTGG AGAGTTATCC ATCAGTTTTT GCACATATTT ACCTAAGATA 3 00 

TCATTTTCAA GATTTACTGT ACTCCCGACT TGTTTACTCT TAAGAATGGT TTGTTTCAAG 360 

GTATGAGGGA TAACAGATAC TGAAAAGTTT ACTTTGGAGA CTTTAGCGAC AGTCAGACTA 420 

ATGCCGTCAA TTGTAATAGA TCCTTTTTCA ACTATTAAAT CTAAAATTTC TTTTTGTGTG 4 80 

TTGATTTGAT ACCATACAGC ATTATCATCT TTTTTTATTG ACGAGATTTT TCCTGTACCA 540 

TCAATGTGTC CTGTAACGAC GTGACCCCCA AGTCGACCGT TGACAGATAA GGCTCTTTCT 6 00 

AGATTCACCT CACTTCCATG TTTTAATAGA GTAAGAGCTG TTCGACTCCA TGTTTCATTC 660 

ATTACATCAA CTGTAAAGGA TTGATGATTG AAATGAGTAA CTGTAAGACA GATACCATTT 72 0 

ACTGCTATAC TATCGCCTAA ATGGATATCC GTTAATATTT TTGAGGCTTT AATTGATAGT 7 80 

TTACAATTAC GAGAGTCTTT CTGTATTCCT TCAACTTTTC CGATTTCTTC AATTATTCCT 840 

GTGAACATGG ATAAATCACT TCACTTTCTA TGAGATAGTC NTTTCCTNTT TGAGAAAAAG 900 

CATAAGGTTT CAATCTAATA GCGTCATTTG GCAAAGAAAT GCCTTCACCT CCGACAGGAA 960 

ACTTGGCACT GCCTCCAAAA ATTTTTGGTG CAATATATAT TTTCAGCTCA TCAACAATTT 1020 

GTTGTTCCAA AGCACTCCAA TTCATTAGAC TGCCCCCCTC TACAACTAGG CTATCAATCT 1080 

GCATGTTTCC TTAGATGTTN GCATTAAACT CNGATAAGTC TATATGATTG CCTTTTTTCT 1140 
TTATGGAAAG TATTCCCC 

1158 



69 



WO 97/37026 



PCT/US97/05306 



(D) TOPOLOGY: linear 
<ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



TAAAGGCTGT 


TTGGATAATG 


TAACATAAAG 


AAATCTTGTT 


TAGCAGGGAT 


TGAATAGAGT 


60 


TCCTGAAGTG 


ATGACTTCTT 


TATTAGACAT 


AATGTCTGGA 


CCCAAGATTA 


AAAGAATGGC 


120 


CCCGAAGAAT 


ACGAGCATCA 


AAGGTAGCAG 


ATGCAACAAC 


TGTATCGTGG 


AAGATTGAGA 


180 


GG AAC TTAGG 


TAATTTAAGA 


TTGTCTAAAC 


TTTCTTCTTT 


CTTACCAAAG 


CGTCCTGCTA 


240 


CTTTATCTAC 


AAACCAGATT 


GCAAATTGCT 


GTTGGTG AC C 


AATCGCAAAT 


CCGCCACCAC 


300 


CAGTCAAGCG 


TTGAGTTGCC 


TCAACAGTCA 


TATTTGAACT 


AACTGCCCAG 


TAAAGTCCAC 


360 


AGATGATACC 


AATCGCTGCT 


GTACCGTAAG 


CATTGCGCAA 


TTGTGGTACT 


AAGAATAGAA 


420 


CCATAAGAGA 


TACTGTTGCA 


GCTAGCCATG 


CTGATCAATT 


CCCTCAACCA 


GTCCTATATC 


480 


TCAGCCAAGA 


GAATCGAGGA 


GGTCTTTGCG 


GAATCTCCCG 


AAAAC ATC C A 


TTCAGAATTA 


540 


GAACAAAAGC 


AAGTTACCAG 


TGGTCGGGTT 


TTAC AAGTCC 


AAGAATTGAC 


TTTTACCTAT 


600 


C CTGATGCGG 


CCCAGCCTTC 


TCTGAGAGAC 


ATTTCCTTTG 


ATATGACTCA 


AGGACAAATC 


660 


CTTGGTATCA 


TTGGGGGGAC 


TGGTTCTGGT 


AAATCAAGCT 


TGGTGCAACT 


CTTACTTGGA 


720 


C TTTATC C AG 


TAGACAAGGG 


GAACATTGAC 


CTTTATCAAA 


ATGGACGTAG 


TCCTCTTAAT 


780 


TTGGAGCAGT 


GGCGGTCTTG 


GATTGCCTAT 


GTACCTC AAA 


AGGTCAAACT 


CTTTAAGGGA 


840 


ACTATTCGTT 


CCAACTTGAC 


TTTAGGTTTA 


AATCAAGAAG 


TATCTGACCA 


GAAACTCTGG 


900 


CAGGCCTTGG 


AGATTGCGCA 


AGCTAAGGAT 


TTTGTCAGTG 


AAAAGGAAGG 


ACTTTTGGAT 


960 


GCCCTAATTG 


AAGCAGGGGG 


GCGAAATTTC 


TCAGGTGGAC 


AAAAACAAAG 


GTTGTCTATC 


1020 


GCCCGAGCAG 


TCTTGCGCCA 


AGCTCCGTTT 


ATC ATCC TAG 


ATGATGCAAC 


CTCGGCACTG 


1080 


GATACCATTA 


CAGAGTCCAA 


GCTCTTGAAA 


GCTATTAGAG 


AAAATTTTCC 


AAACACGAGC 


1140 


TTAATTTTGA 


TCTCTCAACG 


AACCTCAACT 


TTACAGATGG 


CGGACCAGAT 


TCTCCTCTTG 


1200 


GAAAAAGGTG 


AGTTGCTAGC 


TGTTGGCAAG 


CACGATGACT 


TGATGAAATC 


CAGCCAAGTC 


1260 


TATCGTGAAA 


TCAATGCATC 


CCAACATGGA 


AAGGAGGACT 


AGAATGAAAC 


GACAAACTGT 


1320 


AAACCAGACG 


CTCAAACGTT 


TAGCCGTAGA 


TTTAGCAAAC 


CATCCCTTCC 


TCCTTTTCCT 


1380 


AGCCTTTCTA 


GGAACTATTG 


CCCAAGTTGG 


CTTATCAATT 


TACCTACCTA 


TTCTGATTGG 


1440 


GCAGGTCATT 


GACCAAGTCC 


TAGTGGCTGG 


TTCATCACCA 


GT 




1482 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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CTCTGAAGGA TCCAGAAAGT CGCTTATATA TTTTATTAAA AGACGGTCAG GTTATTGGGA 
CTTGTACGGT TGATTTATCG ACTAATACGA ATTAC TTCTA CGGTTTAGCA ATATTGGAAC 120 
CTGAACGTGG AAAAGGCTAT GGAAGCTACT TAGCAAAATC CCTCGTCAAC CAACTAATTG 180 
AGCAAAATGA CAAGGAATTT CAGATTGCAG TGGAAGATAG CAATGTAGGT GCCAAACGTT 
TGTATGAAAA AATTGGCTTT GTCAAACAGA CTCAGGTGGT TTATCTGAAT GAGAAAGGAG 
CAAGGGATTC CGAAGTGTAG AGATATTCGG ACTGAAATTT ACTTGAACTT TTAGTGATGA 
AACTAATTGT TCTTGGATTT CAGCTTTCCT GATTATGATT TATGATTAAA ATCTATGACA 
CCATGTCTCG TGATTTGCGA GAATTTGTCC CGATTGAGGA CGGCAAGATC AAGATGTATG 
TTTGTGGGCC AACGGTGTAC AACTATATCC ACGTGGGAAA CGCCCGTTCG ACGGTAGCTT 
TTTTGGATAC GAATTTCGTC GCTATTTTGA GTACCGTGGG TATAAGGTTG CCTATATTTC 
CAATTTTACA GATGTGGATG ATAAGATTAT CAACCGTGCC AGGGAAGAAG GCATCACGCC 
TCAGGAGGTT GCGGATAAGT ACATCGCTGC CTTTCGTGAG GATGTGACGG CCTTGGGCGT 
GAAACCTGCG ACTCGCCATC CGCGTGTAGT GGAGTTTATG GCAGACATCA TCCGTTTTGT 780 
GGAAGACTTG ATCGAGAAAG GCTTTGCCTA TGAGAGTCAA GGGGATGTCT ATTTCCGTGT 840 
AGAAAAATCC CACAACTATG CTAAATTGGC TAATAAAACC TTGGAAGATT TGGAG 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 



(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 



60 



240 
300 
360 
420 
480 
540 
600 
660 
720 



895 



SEQ ID NO: 


27: 






GCAGAACGTG 


TGGCGCAGGA 


GATTGGGATA 


60 


TCAGATGGAG 


AGATTCAGGT 


CAACATTGAA 


120 


CTACAATCAA 


CTAGTTCGCC 


TGTAAATGAC 


180 


GCTTTGAAGC 


GTGCGAGTGC 


AGAATCTGTC 


240 


CGTCAGGATA 


GAAAGGCGAG 


AGCGCGTGAG 


300 


CTTGAAGTAG 


CTGGAGTGGA 


TCGTTTATTG 


360 


GGATTCTTTG 


ATATTCCTGT 


GGATCATTTG 


420 


GAGCGTCCTG 


GTATGGTTGG 


TTCTGACTAT 


480 


ACTCGTGCTC 


GTAAGTTGGC 


AGAATTTTTG 


540 


CGTAGCGTTG 


ATAAGATGAA 


TACTAGTGAA 


600 


AACCACTTGT 


AGCTTCGATT 


GATGATATGT 


660 


GCAGATGCTC 


TTGCGGAAG 




709 
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(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



CTAGAAGAAC 


TCCGTCACAA 


GCTTGTGACT 


TGTAAGCAAG 


AACAGAGCAA 


GATTGAGGAT 


60 


TCCTTATGAT 


TTCATTCCTT 


CTTCTATTGG 


TCTTGGTTTG 


GGGATTTTAT 


ATCGGCTATC 


120 


GGAGAGGCCT 


GCTCTTACAG 


GTTTATTACC 


TGATTTCAGC 


CATGGCATCG 


GCTTTTATGG 


180 


CTGGCCAGTT 


TTATAAGGGG 


CTTGGAGAGC 


AATTCCATTT 


ATTGCTCCCT 


TATGCAAATT 


240 


CGCAGGAAGG 


TCAGGGGACT 


TTCTTTTTCC 


CATCGGATCA 


ACTCTTTCAG 


CTGGATAAGG 


300 


TCTTTTATGC 


AGGTATCGGC 


TACTTGCTTG 


TATTTGGGAT 


TGTCTATAGC 


ATTGGTCGTT 


360 


TGCTTGGTCT 


TCTCTTACAC 


TTGATTC CT A 


GCAAAAAACT 


GGGTGGTAAG 


TTGTTCCAAG 


420 


TTTCAGCAGG 


TATCTTGTCC 


ATGTTGGTGA 


CCTTATTTGT 


CTTGCAAATG 


GCCTTGACAA 


480 


TCTTGGCGAC 


CATCCCCATG 


GCAGTTATAC 


AAAATCCTCT 


TGAAAAGAGT 


ATCGTCGCAA 


540 


AACACATCAT 


CCAGAGCATA 


CCGATAACAA 


CCAGTTGGCT 


CAAACAAATC 


TGGGTGACAA 


600 


ATTTAATCGG 


ATAAAAAGGG 


CAGGAGTTTT 


CCTAGCCCTT 


TGTTTACAGA 


TTTGACTCGA 


660 


ATCTATCAGA 


ATGTAAAAAG 


CTACCACACC 


TAGACATTCA 


AAGACAAGGA 


AATAAAGATG 


720 


AATAAGAAAA 


TATTAGAAAC 


ATTAGAGTTC 


GATAAGGTCA 


AGGCCTTGTT 


TGAGCCTCAT 


780 


TTGTTGACCG 


AGCAGGGCTT 


GGAGCAATTG 


AGACAG 






816 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



CTTGAAGCAG 


CTGAAAGCAT 


GGGGTCTTCG 


GGATTTAAAA 


CCTATTCGTA 


ACGGGTTGTT 


60 


TTACCTTTCC 


TAGTTCCAAC 


CTTACTAGCA 


GCTCCTTGCT 


TGTATTTATG 


AGAGCATTCT 


120 


CAGACTTTGG 


AACGCCTATG 


TTGATTGGCG 


AAGGATATCG 


GACTTTCCCT 


GTCCTGATTT 


180 


ATACCCAATT 


TATTAGCGAG 


GTTGGAGGAA 


ATTCTGCTTT 


TGCAATTATG 


GCGATTATCA 


240 


TTGCCTTGGC 


AATTTTCCTT 


ATCCAAAAAC 


ACATTGCAAA 


CCGCTACAGT 


TTCAGCATGA 


300 


ATCTGCTCCA 


TCCAATTGAG 


CCTAAAAAAA 


CTACAAAAGG 


AAAAATGGCT 


GCCATTTATG 


360 


CAACAGTCTA 


CGGAATTATC 


TTTATCTCTG 


TTTTACCTCA 


AATCTACTTA 


ATTTATACCT 


420 


CTTTCCTAAA 


AACATCAGGT 


ATGGTATTTG 


TTAAAGGTTA 


TTCTCCAAAC 


AGTTACAAGG 


480 
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TAGCTTTCAA TCGTATGGGA TCTGCTATTT TCAATACCAT TCGTATCCCT TTGATTGCCT 540 
TAGTTCTAGT TGTTCCTATT TACGACATTT ATCTCCTACC TAGCCGTTAG AAAACGGAAT 600 
nGTTTACAA ACTTAATTGA CAGCCTCAGT ATGGTACCTT AT ATTGT AC C AGGAACCGTT 660 
CTAGGGATTG CCTTCATTTC TTCCTTCAAT ACTGGTCTAT TTGGAAGTGG ATTTCTTATG 
ATTACAGGGA CTGCTTTCAT CTTGATTATG TCTCTATCTG TCAGAAGATT ACCGTATACT 
ATTCGCTCAT CTGTTGCTAG CTTACAACAA ATAGCACCAA GTATTGAAGA AGCTGCTGGA 
AAGCTTAGGA AGTAGTCGTC TCAATACCTT TGCTAAGATT ACAACTCCAA TGATGCTATC 
TGGTATCATT TCTGGAGCCA TCTTATCTTG GGTCACAATG ATTTCAAAAC TCTCTACTTC 
TATCCTCCTC TACAATGTCA AAACAAGAAC AATGACTGTA G 



(2) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



720 
780 
840 
900 
960 
1001 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CTGGTTTGCA ATCTATTCCA ACTGAGATTA CAGAGGCAGC AAGGATTGAT GGTGCGACTA 
GCAAGCAAGT TTTCTGGAAC ATTGAATTGC CTTACTTGCT ACCAAGTGTC TCTATGGTCT 
TTATCCTAGC CCTAAAAGGT GGGCTGACTG CCTTTGACCA AGTCTTTGCC ATGACCGGTG 
GTGGTCCAAA CAATGCCACA AC CTC AC TTG GGCTCTTGGT TTATAACTAT GCCTTTAAAA 
ACAACCAATT CGGTTATGCC AATGCCATTG CCGTAATCTT GTTCCTCTTA ATTGTAGTGA 
TTTCGATCAT CCAATTGAGA GTATCTAAGA AATTTGAAAT TTAAGAGGAG AAGCATGATG 
AAACAAGATG AAAGAAAAGC CCTGATTGGC AAATACATTC TATTGATTCT AGGATCGGTT 
CTGATTTTAG TGCCGCTCCT TGCTACCCTC TTTAGTTCCT TTAAAC CC AC TAAGGATATT 
GTAGATAATT TCTTTGGCTT TCCAACCAAC TTCACATGGG ACAACTTTAG CCGTCTCTTA 
GCTGATGGGA TTGGAGGCTA TTATTGGACT CTGTCGTCAT CACTGTCTTG TCTTTACTTG 
CAGTAATGAT CTTTATCCCT ATGGCAGCCT ACTCCATCGC TCGCAATATG AGTAAAAGAA 
AAGCCTTTAC CATTCATGTA TACCCTCTTA ATCCTCGGAA TCTTCGTACC TTTCCAAGTC 
ATCATGATTC CGATTACGGT TATGATGAGT AAACTCGGTT TGGCTAATAC CTTTGGTTTG 
ATCTTGCTCT ACTTGACCTA TGCGATTCCA CAGACCCTCT TTCTCTATGT TGGATATATC 
AAAATCTCGA TTCCAGAAAG TCTGGATGAA GCAGCAGAGA TCGATGGGGC TAATCAATTT 
ACAACCTATT TCCGCATCAT NTTCCCAATG ATGAAACCGA TGCATGCGAC AACCATGATC 
ATCAATGCCC TTTGGTTCTG GAATGACTTC ATGTTGCCAC TCCTTGTCTT GAACCGGGAT 
TCCAAAATGT GGACTCTGCC TTTGTTCCAA TACAACTACG CAGG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1064 
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(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 



CTGTAAAAGA 


AGTTAATCAA 


GTTGATAAAG 


C AC TTTT AAA 


ACAAATTGGT 


GCAGTTGATG 


60 


TCTTAGAAGT 


GAAGGGTGGC 


ATTCAAGCAA 


TCTATGGAGC 


AAAAGCAATC 


TTATATAAAA 


120 


ATAGTATTAA 


TGAAATTTTA 


GGTGTAGATG 


ATTAAGTACT 


TACTGACTTA 


ATAAAAAACA 


180 


GAGGAGAGTG 


ATGGATGAGT 


AGGATGAAAT 


GAAATCGCAT 


ACAAGAAATA 


AAGAACTCAT 


240 


TATCCAAGTT 


GGATACGCTT 


ATTACATAGG 


AGAATACAAA 


TGAAATTTAG 


AAAATTAGCT 


300 


TGTACAGTAC 


TTGCGGGTGC 


TGCGGTTCTT 


GGTCTTGCTG 


CTTGTGGCAA 


TTCTGGCGGA 


360 


AGTAAAGATG 


CTGCCAAATC 


AGGTGGTGAC 


GGTGCCAAAA 


CAGAAATCAC 


TTGGTGGGCA 


420 


TTCCCAGTAT 


TTACCCAAGA 


AAAAACTGGT 


GACGGTGTTG 


GAACTTATGA 


AAAATCAATC 


480 


ATCCAAGCGT 


TTTGAAAAAG 


CAACCCAGAT 


ATAAAAGTGA 


AATTGGAAAC 


CATCCACTTC 


540 


CAGTCCAGTC 


CTGAAAAATC 


ACAACAGCCA 


TCCGAAGCAG 


GAACAG 




586 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



TGGTAGTTCT 


TCCTAGTGCA 


GGGGCTGTAG 


ACCCAGTTGC 


GACCCTAGCG 


CTGGACTAGT 


60 


CGAGAGGGTG 


TTGTTGAAAA 


TGGATGGNTA 


TCGCTATGTT 


GGTTATCTAT 


CAGGTGACAT 


120 


CCTC AAAACG 


CTTGGCTTGG 


ACACTGTTTT 


AGAAGAAACC 


TCAGCAAAAC 


CTGGAGAGGT 


180 


GACTGTAGTC 


GAAGTTGAGA 


CTCCTCAATC 


AACAACAAAT 


CAGGAGCAAG 


CTAGGACAGA 


240 


AAACCAAGTA 


GTAGAGACAG 


AGGAAGCTCC 


AAAAGAAGAA 


GCACCTAAAA 


CAGAAGAAAG 


300 


TCCAAAGGAA 


GAACCAAAAT 


CGGAGGTAAA 


ACCTACTGAC 


GACACCCTTC 


CTAAAGTAGA 


360 


AGAGGGGAAA 


GAAGATTCAG 


CAGAACCATC 


TCCAGTTGAA 


GAAGTAGGTG 


GAGAAGTTGA 


420 


GTCAAAACCA 


GAGGAAAAAG 


TAGCAGTTAA 


GCCAGAAAGT 


CAACCATCAG 


ACAAACCAGC 


480 


TGAGGAATCA 


AAAGTTGAAC 


CACCAGTAGA 


ACAAGCAAAA 


GTCCCAGAAC 


AACCCGTGCA 


540 


ACCTACACAA 


GCTGAGCAAC 


CAAGTACACC 


AAAAGAATCA 


TCACAACAAG 


AAAATCCTAA 


600 


AGAAGATAGG 


GGAGCGGAAG 


AGACACCGAA 


ACAAGAAGAT 


GAACAGCCAG 


CAGAAGCCCC 


660 


AAGAAATCAA 


GGTTGAAGAA 


CCAGTAGAAT 


. C AAAAGAGGA 


GACTGTTAAT 


CAACCTGTTG 


720 



74 



WO 97/37026 PCT/US97/0S306 

AACAACCAAA AGTGGAAACG CCTGCTGTAG AAAAACAAAC GGAACCAACA GAGGAACCAA 7 80 

AAGTTGAAGT AACAAGTATT CCCCAAACTA CTCGCTATGA GGAAGACCTT ACTAAGGAAC 840 

ACGGAACGCG TGAAGTTGTT AAGGAAGGTA AGAATGGCAG TAGAACAGTT ACTACTCCAT 900 

ATATCTTGAA TGCGACAGAT GGTACGACTA CAGAAGGCAC TTCGACAACT GATGAAGCTG 960 

AGATGGAGAA AGAGGTTGTT CGTGTTGGCA CGAAACCCAA AGAAAAATTA GCTCCAGTCT 1020 

TAAGTTTGAC AAGTGTTACA GATAATGCAA TGTTGCGTAG TGCGAGACTT ACTTATCATT 1080 

TGGAAAATAC AGATAGTGTT GATGTGAAAA AAATTCATGC TGAAATTAAA AATGGCGATA 1140 

AGGTTGTCAA AACTATTGAC TTATCTAAAG AGAGATTATC AGATGCTGTT GACGGTCTTG 12 00 

AACTTTATAA AGATTATAAG ATTGTGACGA GTATGACCTA TGATAGAGGT AATGGTGAAG 1260 

AAACCTCTAC GTTGGAAGAA ACTCCACTAC GATTAGACCT CAAGAAGGTT GAATTGAAAA 1320 

ACATCGGCTC TACTAATCTC GTCAAAGTAA ATGAGGATGG TACTGAGGTG GCAAGTGACT 13 80 

TCTTAACAAG TAAACCTGTG GATGTGCAGA ATTACTACCT CAAAGTAACT TCCCGTGATA 1440 
ATAAAGTTGT TTCCCCTCCC AGTTGAAAAA ATTGAAGAGG TGACTGAGGA AGGTCCACCA 
CTTTACAAAG TCCCTGCTAA GGCCCTAATT TGAT 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGGATTTGTT TATCTCAAGA AGAGTCGCGA TATTCTCCGT GAAAGTTCAG AATTGATTAN 
CCAAACGGTA GAAGATTATC TTCAAGGAGA TGACTTTGAC TGGGCAGATC TTAAAGGGAA 
GGTTCGANAT AATTTGACCA AGTATCTCTT TGACCAAACC AAGCGTCGTC CANCTATTTT 
ACCAGTAGTC ATGGAAGCAA AATAATCGTT GAAATAAACA GAGAGAAAGT CGAGTTTCGG 
CTTTTTCTTA TAGAAAAATA GAAGGAGAAA ATCATGGCAG TAATGAAAAT CGAGTATTAC 
TCACAAGTAT TGGATATGGA GTGGGGGGTG AATGTCCTCT ACCCTGATGC CAATCGAGTG 
GAAGAACCAG AGTGTGAAGA TATTCCCGTC TTGTACCTTT TGCACGGGAT GTCTGGAAAT 
CATAATAGTT GGCTTAAGCG GACCAATGTA GAACGCTTGC TTCGAGGAAC TAATCTCATC 
GTTGTTATGC CCAATACCAG CAATGGTTGG TACACCGACA CCCAGTATGG TTTTGACTAC 
TACACGGCTC TAGCAGAGGA ATTGCCACAG GTTCCTGAAA CGCTTCTTCC CTAATATGAC 
TAGCAAGCGT GAAAAGACCT TTATCGCTGG TCTTTCTATG GGAGGCTACG GCTGCTTCAA 
ACTGGCTCTT ACGACAAATC GTTTTTCTCA TGCAG 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 1096 base pairs 



1500 
1534 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
695 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CTCATCCAAG 


AATAGCATAA 


AAGAACTCTT 


CTGTTTTTTC 


TTGAGGAACG 


AGAAAGTCAG 


60 


C AATCTC CAT 


TCCACGATCC 


ATCAAATCCT 


CTAAAGACAT 


CGTGATTTTT 


AAAGTAGTAT 


120 


CACTAATTTG 


TTTCATTTTC 


ATTGCTAGTA 


ACCTCATACT 


TTCAGTTCTA 


TCTATTATAC 


180 


TAGATTTTTA 


CGATTTTATC 


AAAAGAAGGC 


TCCTCTATAC 


GGATAGATTT 


TCCCTAGGGT 


240 


CTTTCTATAG 


GAGACTCCAA 


GAGAAAATTT 


CTGCAGACAG 


ATAGAAAAAG 


CCTTCAAAAT 


300 


CGGCTCTTAG 


CCGATTTTGA 


AGACCTTATA 


CATCAGAATA 


CTTATAATTT 


AAAGGTTGCT 


360 


ACACCGAGGA 


TAGAACGATT 


TAAGTTTCTG 


AGAATTTGAA 


GACTTTGCTC 


AAATTTCTTA 


420 


TAACGAATTC 


ACTCCGTACT 


CTTCAACAAG 


AAGGACTGTA 


TCTCTTTCCA 


AAAGAGATGA 


480 


TACATCCTGC 


AAATCTACAA 


ATGCATTCCT 


TTTAAAGCTT 


CTTGACTCTG 


TTTCAATTTA 


540 


TCTAGGATAG 


CTTTATTTGA 


GCTAACGATG 


GTCAATTCCT 


GTCCAGTATT 


TTTGTATGAC 


600 


AAAACATCTG 


CTAGGTTAGC 


AATTGTTGTA 


ATCTCTGTTA 


CAAAATCAAT 


TTGATACTGA 


660 


GAAAAATCAC 


CTACTCTATT 


GATTGTTGGA 


TTAAAGAGAT 


AAACTAACAC 


ATTTCCCATC 


720 


ACAACCAAAA 


TCACACAAAC 


CACTCCAATA 


ACAACTAAAC 


GAAGAATCAG 


ATTTTTCACA 


780 


TTTAAGCCAA 


GCGCTGTTTC 


AC C ATTTGC G 


TTCAATTCTT 


TAGAGTTGAT 


GGTTTCCAGT 


840 


TTTTCAATTT 


TCACATTTGC 


ATAAGCATGT 


TTAAATTTCT 


CAATCAACCC 


ATCAATTTTT 


900 


TTCTCTAACA 


AGTCATTGGC 


ATCTTTACTT 


GATGTCAAAA 


TTTTCACACC 


AACCCCTGCA 


960 


TCGTCAATCA 


TATAGTAGAC 


GGTCAATTTT 


TTCCACCAAT 


AGTCATTCGT 


TGAATTTTTC 


1020 


AAGGTTGTTT 


CCGTCGTGTC 


TAATTCACTG 


GCAATTTTTT 


TCAACTCACT 


GGGTTCTACA 


1080 


TCATTGAAAA 


GATAAG 










1096 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1037 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 

TCCCCATCCC AGTGGGCAGC AGCATTAGTC TGTACTTCTG CGTCGTTGCG TTTTATTAGA 60 

AGACTGATTT TGCTTTGAAC TACTTGAAGC TGTCGTTGTA TCCTGCGAAG TTTTTGCTAA 120 

CTTCAGTTTC ATTTGCAGCG ACATGACTAG TTGCCAAGCC TAGTAAACAG ATACTTGCTA 180 

ATCCAATTTT TGTTTTCAAT CTTTCCTCTC CTATAAAAAA TGTAACAGAC ATCTGAATGC 240 

76 



300 
360 
420 
480 
540 
600 
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720 
780 
840 
900 
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TGTTCCACCT AGCTTTTGCT ACTTACTGAT TATTTTACAA AGTCAAGCAA AGCCAAGAAG 
CTTTCAGCTT CAAGTGACGC ACCACCTACA AGGGCACCGT CAACGTCTGG GCAAGCCATG 
TATGAAGCAA CATTTTCAGG TTTAACAGAA CCACCGTATT GAACACGAAC TTTGTCTGCG 
ACTTCTTGAC CAAAGTCAGC AGCTACAACG TCACGAACAA CTTTACACAT TTTTTGTGCA 
TCGTCTTGTG AAGCTGATTT ACCAGTACCG ATAGCCCAGA TTGGCTCATA AGCGATAACT 
GAGGCAGCAA CTTGTTCAGC AGTCAATCCA GCCAATGCAG CAGATACTTG AGCACCTACG 
AATTCAGCAG CTTTACCAGC TTCGTAAGTT TCAAGTGATT CACCACAACA GATGATTGGA 
AGCATACCGT TCGCAAAGAT TGCTTTTGCT TTTTTGTTGA TATCTTCGTC AGTTTCATGG 
AAGTAGTCAC GGCGTTTCTG AGTGACCGAT AACAACGTAG TCAGTACCGA TTTCTTTCAA 
AACTTGTGGG CTAGTTTCAC CAGTGAAAGC ACCTGCATTT TCTCCGTAGC AGTTTTGAGC 
AGCGACTTTT AAGTTTGAGC CTTNGGCAAC AGCAAGNACA GTTGTCAAAT CAAGAGCTGG 
AGCAGNGATA CCTGCTTCAA CAAGATCTGA TGAAGGAAGT TTTGATGCAA CTGCTTCAAC 
GAATGNTCCA GCTTNTTGGG GATTTTTGTT CATTTTCCAG TTACCAGCGA TAAATGG TTT 1020 
ACGTGACATT TCACATA TT 1020 

1037 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CTTATCGTAC TAAGGATGGC AGTGTTCAAC TGTTCCGTCC TGATGAAAAT GCTAAACGCC 
TGCAACGTAC ATGTGACCGT CTCTTGATGC CAACAAGTTC CGAACAGACA TGTTTGTAGA 
AGCTTGTAAA GCAGTTGTCC GTGCGAATGA AGAATACGTA CCACCATACG GAATAGGT^ 
AACTTTATAT CTTCGCCCTC TTTTGATTGG TGTCGGAGAT ATTATCGGGG TA^CC^ 
^ GTAC AT1TTCACCA ^TTTGCTAT GCCAGTTGGA AATTACTTTA AAGGTGGTTT 
GGTCCCAACC AACTTCTTGA TTCAGGATGA GTACGACCGT GCAGCACCAA ATGGTACAGG 
TGCGGCTAAG GTTGGTGGAA ACTATGCTGC AAGTCTCTTA CCAGGAAAAA TGGCCAAGTC 
ACGCCATTTC TCAGATGTTA TCTATCTGGA CCCATCAACT CATACAAAGA TTGAAGAAGT 
CGGATCACCT AATTTCTTTG GAATTACAGC TGATAATGAA TTTGTAACAC CA^CC 
ATCTATCTTG CCATCTATTA CCAAGTATTC CTTGCTTTAT TTGGCAGAAC ATcIcTTO^G 
ATTAACTCCT ATTGAGGGTG ATGTTCCAAT TGATAATCTT GACCGTTTTG TAAAGGCAGG 
TGCCTGTGGT ACAGCAGCGG TTATTTCTCC AATTGGAGGT ATTCAACATG G^GA^T 
CCATGTTATT CTATAGTGAA ACAGAAGTAG GTCCTGTGAC ACGTAAATTA TAT^gHt 
^CGCTAT TCAGTTTGGC GATATTGAAG CGCCAGAAGG TTGGA^TA ZZZ^ 
AAAATAAACC AAAGGAGATT TTTTATGAAA TAGAAAAAGT GGCTCTTAAC AGCAGGAGTG 
GTCCTGAGCA CGTCAGCTAT TTTAGTGGC T TGTGGAAAAA CTGATAAAGA AGCAGATGCA 
CCGACAACAT TTTCTTATGT CTATGCAGTA GATCCAGCAT CATTGGGCTA CAGTATAGCG 
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ACTCGAACAT CGAGGACAGA CGTTATTGGA AATGTTATTG ATGGTTTGAT GGAAAATGAT 1080 

AAATACGGCA ATGTTGCTCC TTCTCAAAAA GACTATGATT TGAACAGTAC AGGATGGGCT 1140 

CCAAGCTATC AAGATCCAGC GTCTTACTTG AATATTATGG ATCCAAAATC TGGTTCTGCC 1200 

ATGAAACACC TTGGCATTAC GAAAGGAAAA GATAAGGATG TTTGTAACTA AACCTGGTTT 1260 

GGATAAATAT AAGAAATTGT TAGAAGATGC TGTTTCTGAG ACCACTGACC TAGAGAAGAG 1320 

ATATGAAAAA TATGCCAAAG 1340 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



CTCTGACGGC 


GTCGCCACTT 


AAGAAGAGTA 


TCAAAAAGAA 


AAATAGAAAA 


TTAACTAACA 


60 


AGGNAGAAAA 


ACACATGTCT 


AAAATTATCG 


GTATTGACTT 


AGGTACAACA 


AACTCAGCAG 


120 


TTGCAGTTCT 


TGAAGGAACT 


GAAAGCAAAA 


TCATCGCAAA 


CCCAGAAGGA 


AACCGCACAA 


180 


CTCCATCTGT 


AGTCTCATTC 


AAAAACGGAG 


AAATTATCGT 


TGGTGATGCT 


GCAAAACGTC 


240 


AAGCAGTCAC 


AAACCCAGAT 


ACAGTTATCT 


CTATCAAATC 


TAAGATGGGA 


ACTTCTGAAA 


300 


AAGTTTCTGC 


AAATGGAAAA 


GAATACACTC 


CACAAGAAAT 


CTCAGCTATG 


ATCCTTCAAT 


360. 


ACTTGAAAGG 


CTACGCTGAA 


GACTACCTTG 


GTGAGAAAGT 


AACCAAAGCT 


GTTATCACAG 


420 


TTCCGGCTTA 


CTTCAACGAC 


GCTCAACGTC 


AAGCAACAAA 


AGACGCTGGT 


AAAATTGCTG 


480 


GTCTTGAAGT 


ANGACGTATT 


GTTAACGAAC 


CAACTGCAGC 


AGCTCTTGCT 


TATGGTTTGG 


540 


ACAAGACTGA 


CAAAGAAGAA 


AAAATCTTGG 


TATTTG AC CT 


TGGTGGTGGT 


ACATTCGACG 


600 


TCTCTATCCT 


TGAATTGGGT 


GACGGTGTCT 


TCGACGTATT 


GTCAACTGCA 


GGGGACAACA 


660 


AACTTGGTGG 


TGACGACTTT 


GACCAAAAAA 


TCATTGACCA 


CTTGGTAGCA 


GAATTCAAGA 


720 


AAGAAAACGG 


TATTCGACTT 


GTCTACTGAC 


AAGATGGCAA. 


TGCAACCTTT 


GAAAGATGCG 


780 


GCTGAAAAAG 


CCAAGAAAGA 


CCTTTCTGGT 


GTAACTTCAA 


CACAAATCAG 




830 



(2) INFORMATION FOR SEQ ID NO : 3 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



PCT/US97/053O6 



GTGGTGGTTT GACTGCCCTT ATCGAAGCAG GTTTCGAANT CTAAGACANA AGCAGGTTAC 
GCTCCAGAAT TGGCTTAACT TTGAAGTTCT TCACGGAAAT GAAATTGATC GTTGACTTGA 
TCTACGAAGG TGGATTCAAG AAAATGCGTC AATCTATTTC AAACACTGCT GAATACGGTG 
ACTATGTATC AGGTCCACGT GTAATCACTG AACAAGTTAA AGAAAATATG AAGGCTGTCT 
TGGCAGACAT CCAAAATGGT AAATTTGCAA ATGACTTTGT AAATGACTAT AAAGCTGGAC 
GTCCAAAATT GACTGCTTAC CGTGAACAAG CAGCTAACCT TGAAATTGAA AAAGTTGGTG 
CAGAATTGCG TAAAGCAATG CCATTCGTTG GTAAAAACGA CGATGATGCA TTCAAAATCT 
ATAACTAATT AGAAATATAT AGCGCTGGAG ATGATTTTAT GAAAAAGATT ATGAGAAAAA 
T TGC ATCGTT ATTATTGGTT CTAGTTGTAT AATGTAATTA CACCGTCGGT AATAGTGCTA 
GCAGACCAAA ATAAAGCAGA TTGGTCGTAT GATGAAAATA CTGTAATTAA CATTTATGAT 
GATGCTAATT TTGAAGATGG TAGGTTGCAT ATGACCTTTG AACAATTCTT CAAATTGGCA 
CAAATAG 

(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CGGTTCCAGA NACTGGTATG GCTAGGAGTC GCGACAAAGT GTATCTTATG ATGGTCGATT 
TGATACTCAA TACGCTCAAT ACTTACGGAA AGAATAAACG ATCATTGTCA ACCGGAATAG 
TATGATAATC GTTTCTTGGT GGAATAAGTT CAATCTTATC CTTCTTAGGC ACAATAATGG 
AAGAGCCCAG TGTTCGATAA ACACGATTAT TAAGGCTGGC AATTTCCGTT AATTGCAAAG 
CTTCAATGGT AGGGTGTAAA ACAGCACCAC CAAGAGACTA . GTTATAGGCA GTACTACCAG 
TCGGTGTCGA AACTGTTAGC CCGTCTCCAC GAAAACGTTC AAAGGGAACA CCATTTATTA 
CAATATCTGC CACCATGGTT CGATCAGACC TGCGGATGCT GGCTTCGTTG AGTGCTCTGA 
AAATCTTAAC TTCACCATTT TCAAGAAAGA CCTTCACATT CAGAACAGGG TAAGAAACCC 
TTGCCCCAGT ATCTAGCTGC AAATTAGTCA CTAGCTTGTC CAACTCAAAA TCACGATAAT 
CTGTATAGAA GCCCAAATGT CCAGTATGAA GACCGATAAA GCGGACCTTG TCAAGCTGAT 
TTTCGTACTT ATGAAAGGCC GACAAGAGCA TACCATCCCC GCCAATGGAA ATGACAATAT 
CCGGATTGGT ATCATTGAGT TATAAACTGA TTTCTCTTCA AACGATCTCG CAATTCATAC 
AAAACCCTTT GACTCTGCGG TTTTCTATTG GCTATCAGAT CAATTCGTTT ACCTGTATTC 
TTCATCTGTA TCGTCACTGN NTCCAACACC GTCATTTAAT l^TCTACTCA AAGGATCAAA 
AAGTGCCTGG GCTTCTTGGA TATCATCACG AATTTCACCC ATTTCTTCAT CCAACTGATG 
GGCGATTCTA GCTGTAATTT CCAGTCGCTT CTTAATCTCA TCTGGGAAAT CCCCTTGGTA 
CTTGTAGTTG AGAGAATGTT CTATCGTTGC CCAGAAATTC ATGGCCAAAG TACGAATTTG 
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AATTTCTGCC AAAATAGTCT TAGCTCCATT 
ATGATAGGAA CGATAGCCTG ATGCTTTTCT 
TCGCATATCC TGACGCTTGT GCAAAATATC 
AACCATCACA CGTAAGCCAG CAATATCCTG 
GCCACGACGA GCCATTTTTT CTTTGATGCT 
CTCAATTGGA GAATGCTTAT TTTGCTTACG 
CTTTAACTCA CCAACAG 



PCT/US97/05306 

GATGGTATCA AC CGT ATATT CTACTACCAC 10 80 

ATGAGTAATG TAATCTCGCT CCTGTATGAT 1140 

CACTACTTCC TTGACGTCAT CTACAAACTG 12 00 

CAAATCGTGT TCCAAGGTCG CATAAGTAAT 1260 

CTCAATTGGC TTGACTCGAC CGGTCACAAA 13 20 

ATATTGCTTA CGAATACCAC GAAGTTTAAT 13 80 

1397 



(2). INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 



CTCCCATGAC 


ACCTGCACGC 


TCCGTACGGG 


CCACGTGCTT 


TTTAATCCGT 


TCAACAACTT 


60 


CATAACCCGC 


TTCAACATCC 


ACACCCGACT 


GGGCATATGC 


ATTTTTATTT 


GTCATTTGTT 


120 


TTTTCCTTTT 


CTTTAATGGA 


GAATCTGTCG 


CCTACTTGTA 


AAAACTGGTC 


TTTTCTTCCA 


180 


AACTTCTACG 


ATAGTCTTCT 


TCGTAGTCAT 


AGAGAGGCGT 


TGGGTAGTCA 


CCGTCAAAGT 


240 


AAGCGACACA 


GAGACCACCG 


TTCGGCGCAT 


CTGTTTCGAT 


AC CAATCGAC 


TCAATCAAGC 


300 


TATCAATTGA 


AAGATAAGTC 


AGACTGTCCG 


CACCAATGAT 


TTGGCGAGTT 


TCTTCGACCG 


360 


TATGATTGGC 


TGCAATCAGC 


TCCTGACGGG 


TCTGGATATC 


AATC CCGTAG 


AAACATGGAT 


420 


ACGCTAGTGC 


AGGACTTCCA 


ATGGCAACGT 


GAACCTCAGT 


CGCACCCGCT 


TCTTTCAAGA 


480 


GCTGAACGAT 


ACGACGAGAG 


GTTGTTCCAC 


GTACAATGGA 


ATCATCCACC 


ATG AC C AC AC 


540 


GTTTGCCTTT 


GACAACACCC 


GAAACAGCAG 


AC AGTTTC AT 


CCGCACTCCT 


TGCTCCCGCA 


600 


ATTCTTGAGT 


CGGTTGGATA 


AAAGTTCGCT 


GGGTGTATTG 


GTTTTTGATC 


AGACCCATTT 


660 


CATTTGGTAA 


GCCTGATTCT 


TCCGCAAATC 


CCATAGCCGC 


GCTTAGGGAA 


GAATTGGGCA 


720 


CACCAACTAC 


AATATCTGCC 


TCATGC TTAA 


ATTCTCGCGC 


CAATTGCGCT 


CCCATTCTCT 


780 


TACGTGCCGT 


ATGGACATTG 


ACCCGTGGAT 


ATTAGAATCA 


GGGC 




824 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



Genomic DNA 
80 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CTTCTTCCTT GGTACTTCCT CAATTGGATT ACCTTTGGCA TTGCTAGTTT TTCACTCCTA 
CCTTATATCC AAGGCACTAA AATTATGTTT TACCGAGCAG TATTGGCTCG AAAACGTCCA 
AAAGCTTGAA GGTTTTCCCC TCAAGCTTTT TTCTATCAAT GAATTTCTCC GCCTACCAAC 
GTGAGTTCCT GATCTCCATG TTCTATCATA GCGCCGATTG CTGCTTCTAT CCCTCGCCGA 
ATATCCACTA AACTCATAGT TGGAGTAGTC GGTCTGTTCA CCACCTGTTC CATCATATAA 
GGAATATGCA TAAAACCTGC CITAACATAT GGGAATTTCT TTTCTACCAA ATAGAGAGCC 
TGATACATCA AATGACTGCA AACAAAAGTC CCTGCACGTA TTGGAAACAG ACTGCCGGTA 
AGTC CTTCTT TTTTATTAGC TTGTACCATC GCTTTGATAG GTAAACTACT AAAATAGGCC 
GATGCTCCAT CAATACGAAT CGGTGTATCA ATTGGTTGAT TGCCTTCGTT ATCAGGTATG 
CGAGCATCAT CTTGATTAAT AGCCACTCGT TCAGGTGTTA AGCCGGTCCT GCCGCCTGCT 
TGTC C AATAC AAAGTACAGC ATCTGGTTGA TATCGTAATA TTTCTGCCTC TAAAACTTCT 
GACGACTTAT AAAAAACCGT TGGAATTTCT ACCCAGCGAA CTTCAGCCCC ATTAATCTCA 
GATGGTAATA ATTTTACAGC CTCCAAAGCT GGATTAATCT TTTCACCTCC AAAAGGATTA 
AAACCTGTAA CCAATATTTT CATTTTATTT TCCTTTACTA AAATGCGAGA AAGTACATTA 
AGAATATGTG AATAACAATC ATTACTAGAG CAACACCTGC TTGAGCCTTT ATAACGCCAT 
TCTGATCTTT CATATCCATC AATGCTGCTG GTAGAGCGTT AAAATTAGCA GCCATTGGGG 
TCAATAAGGT CCCACAATAA CCTGCTGTCA TGGCAAGAGC ACCAGCCACA ATTGGATTAG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CTCGTAAAAA CACTAAAGGA GGTAAATAAC CAATGTTAGT ACCTAAACGT GTTAAACACC 
GTCGTGAATT CCGTGGAAAA ATGCGCGGTG AAGCAAAAGG TGGAAAAGAA GTAGCATTCG 
GTGAATACGG TCTTCAAGCT ACAACTAGCC ACTGGATCAC TAACCGCCAA ATCGAAGCTG 
CTCGTATCGC CATGACTCGT TACATGAAAC GTGGTGGTAA AGTTTGGATT AAAATCTTC C 
CACACAAATC ATACACTGCT AAAGCTATCG GTGTGCGTAT GGGATCTGGT AAAGGGGCAC 
CTGAAGGT TC GGTACCACCA GTTAAACGTG GTAAAGTGAT GTTCGAAATC GCTGGTGTAT 
TGCACGTGAA GCCCTTCGAC TTGCTAGCCA CAAATTGCCA GTTTAAATGT 
AAATTCGTAA AACGTGAAGC AGAATAAGGA GAAGGCATGA AACTTAATGA AGTAAAAGAA 

tt^aag aacitcgtgg TCTTIOTC aa gaagaactcg cgaagggcga ££££ 

AAAAAAGAAT TGTTTGAACT TCGTTTCCAA CCTGCTACTG GTCCAATTGG AAACAAACAG 
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(2) INFORMATION FOR SEQ ID NO: 43: 



PCT/US97/0S3G6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

( C ) STRAND EDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



CTTCACjCGAC 


A1AIL 1A1LL 


ALAAL 1 i V- 1 V~ 




TTrrTTTra a 

1 1 1 V~ 1 oiVA 


A ATrrPTP A A 
AA 1 \jV_ 1 bAA 


D U 


AI H 1AA1 i\3 


AV- lfifii i lun 


rpp fTPl & *T A P P 




TA AP A A A APT 
l/U\LAAnAL 1 


TP A AP ATPPP 




AAAtAi 1 1vjV_ 


T & A/T^"2 R Til 


VjljiL 1 1L1AI 






f^TV^TPTTP AT 


1 ftf! 

-LOU 






ptpp.p a a a^T 


P A PP. A TP ATT 


PTPTA A A A A A 


AT APPTTPTP 


'-t yj 


TCTGAAAAGT 


G ACTG AC CGA 


ATGGGGAAAG 


AAGC TGTTC C 


TCTCTCAAAA 


CTAGTAAACA 


Jul) 


ATGCACGCGC 


AATCCCCTGA 


CGGCGATGAC 


CTGGATGAAC 


CAGTATCGTC 


ACTTCTACAT 


360 


CTTGGTCATC 


TGCATAGACA 


GTTAATAAAC 


CAACAAGTTC 


GCCTTTTTCA 


TAATAAAGGA 


420 


AAAAGGCGGG 


CATGTTTGGA 


TCAAAATTAA 


GCATGTTAGA 


GAGATAGGGA 


TCGCGATAGG 


480 


TACCGTCATA 


GTTTTGGCAA 


CAGTTAATTA 


CTTTTTTCGC 


CTCAGATAGC 


TCTTCTTGGC 


540 


TTAACTTGTT 


TCTTGCTTGA 


ATCATATAGG 


TATC CTCTAC 


AAACCAGACG 


ATCTGTGACT 


600 


GGCATCTTTA 


GCCTGCTCGA 


GTTTATTGAC 


ATAATACTC C 


CGTTTTTCTT 


CGACTTCGTG 


660 


AATAACAGGC 


TCATCTTTCT 


TACCATGAAG 


ACGGACAATC 


TTGGCCGGAA 


TACCGACAAC 


720 


CGTCACATCA 


CTAGGTACAT 


CTGCT AC GAC 


AACTGCTGCA 


GCACCGACCT 


TGGCATTTTC 


780 


ACCAATTTCC 


ACAGGCCCGA 


TAACTTGGGC 


ATGGGCTGAT 


ATGAGGGCTC 


CCTTTCGTAC 


840 


AGTCGGATGG 


CGTTTGCCAC 


AGTCTTTCCC 


TGTTCCCCCG 


AGAGTCACTC 


CGTGATAGAG 


900 


AAGAACGCCT 


TTTTCAACAA 


TCGCTGTCTC 


TCCAATCACC 


AGACCAGAAC 


CATGGTCAAT 


960 


AAAGACACCT 


GAATCAATCT 


GGGCTCCTGG 


ATGAATCTC A 


ATCTGAGTCC 


AAAAGCGCCA 


1020 


AAACTGACTG 


TGCATACGAG 


CTAAGAGTTT 


GAAGTCGTAC 


TTCCAGAGAA 


AATGC GAG AG 


1080 


ACGGTGGGCC 


GCCAAGGCCT 


TGACACCTGG 


ATAAGTCAGC 


AAAACCTCCA 


AAGTGGTGCG 


1140 


GGC CGCTGGA 


TCATTTTCTT 


TTACGATATC 


AATGGTTTCG. 


CGCCACCACC 


CCATACATTT 


1200 


CTCCTTTTCT 


TACTCTGAAT 


CTT 








1223 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GGTGAGTGC C TTTATTAGCG GAGCTTCTCA AGTGATTGTT GCCTTGATTA TCTTCCTTTC 
ATGCTCTTTT ATCTCTTGCG TGATGGGAAA GGCTTGCGTA ACTATTTGAC CCAATTCATT 
CCAAGCAAAT TGAAGGAACC TGTTGGACAA GTTCTATCAG ATGTGAATCA ACAGTTGTCC 
AACTATGTTC GAGGGCAAGT GACAGTGGCT ATTATTGTAG CAGTAATGTT TATCATCTTC 
TTCAAGATTA TTGGTCTACG CTATGCGGTT ACGCTGGGGG TTACTGCTGG TATTTTAAAT 
CTGGTCCCTT ATCTTGGTAG CTTTCTAGCC ATGCTTCCTG CCCTAGTATT GGGTTTGATT 
GCTGGTCCAG TCATGCTTTT GAAAGTAGTG ATTGTCTTTA TTGTAGAACA AACTATTGAA 
GGCCGTTTTG TCTCTCCATT GATTTTGGGA AGTCAATTAA ACATCCACCC TATTAATGTT 
CTCTTTGTTT TGTTAACTTC AGGATCTATG TTTGGTATCT GGGGAGTTTT ACTTGGTATT 
CCGGTTTATG CCTCTGCTAA GGTTGTCATT TCAGCCATTT TCGAATGGTA TAAGGTAGTC 
AGTGGTCTAT ATGAATTAGA GGGTGAGGAA GTCAAGAGTG AACAATAGTC AACAGATGTT 
ACAGGCTTTG GAGGAGCAAG ATTTAACTAA GGCTGAGCAT TATTTCGCCA AAGCTTTAGA 
AAATGATTCA AGTGATCTTC TGTATGAGTT GGCAACTTAT CTTGAAGGGA TTGGTTTCTA 
TCCTCAGGCC AAGGAAATTT ACCTGAAAAT TGTAGAAGAT TTTCCAGAGG TTCATCTTAA 
TCTAG " ^ 



(2) INFORMATION FOR SEQ ID NO: 45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

cl^r ATCTTCTTCT A ^TCCT GTTCCTAAAT ATGTTGGTAG AAAAATGAAG 
CAGTTGTTAC GAGTGTCTGT ATTGGAAAAA AATTTTGTCT ATACAATTCT TTAAATGATA 
TGAACACTAG GAATGCCAAA ACACTOACAT ATTTCGTTTA AAGCCTATCC AGTCTCTTCA 
ATTGTATTTG TAGAAATGCT ATCAGTAATA TATTCGCGAA CACTATGATT TACAACAGTG 
TGGTCGTACA TCTTTGTAAC TTTATTATAC AACAACCATT TACAGGTGTA AAGTTGAGCA 
GTGTCTTTAG TAAATTTAGG GACTTGTTTT GTCATAGTTT TGATACCTGT AITCG^gS 
TCATGAGTAT TTGACGTTTC TTTTCATGTA GGCATTTCAA CAATTAGAAT GATATTTTTC 
CGAGGATGAA CTTATATATT ATGACTTTTT AGAGTTATGA TGCCTCTTGA ^Zlllc 
ZlZl?™ TCATCCACTr TT ^AGTTTC ATGCAACATA GCGAGAAGGC GGTTTCCTAT 
ATGGAAGTCG TCTTGTTTTC ATATTGTCTA AATACGAGAT ATTTACTACA GGTTGTAGAT 
ATTGCAAGCT CAAATGATAT TATTTTAGAG GAGGAGACAA GTGAAATAAT GCCTGGTGAG 
TTAGATGAAA TAAAAAGTAA GGAGTTGAAA GTACTGGTTC NTTGTGCAGG GTC^CA 

£££ aT™ — — -0 
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(2) INFORMATION FOR SEQ ID NO: 46: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



CTGGGACTCT 


CTTCGTAGAA 


GTCATGCAAG 


AATATTTTGA 


TCAAAAGAGG 


AAATCATGAA 


60 


AAAAAGAGCA 


ATAGTGGCAG 


TCATTGTACT 


GCTTTTAATT 


GGGCTGGATC 


AGTTGGTCAA 


120 


ATCCTATATC 


GTCCAGCAGA 


TTCCACTGGG 


TGAAGTGCGC 


TCCTGGATTC 


CCAATTTCGT 


180 


TAGCTTGACC 


TACCTGCAAA 


ATCGAGGTGC 


AGCCTTTTCT 


ATCTTACAAG 


ATCAGCAGCT 


240 


GTTATTCGCT 


GTCATTACTC 


TGGTTGTCGT 


GATAGGTGCC 


ATTTGGTATT 


TACATAAACA 


300 


CATGGAGGAC 


TCATTCTGGA 


TGGTCTTGGG 


TTTGACTCTA 


ATAATCGCGG 


GTGGTCTTGG 


360 


AAACTTTATT 


GACAGGGTCA 


GTCAGGGCTT 


TGTTGTGGAT 


ATGTTCCATC 


TTGACTTTAT 


420 


CAACTTTGCA 


ATTTTCAATG 


TGGCAGATAA 


CTATCTGACG 


GTTGGAGTGA 


TTATTTTATT 


480 


GATTGCAATG 


CTAAAAGAGG 


AAATAAATGG 


AAATTAAAAT 


TGAAACTGGT 


GGTCTGCGTT 


540 


TGGATAAGGC 


TTTGTCAGAT 


TTGTCAGAAT 


TATCACGTAG 


TCTCGCGAAT 


GAACAAATTA 


600 


AATCAGGCCA 


GGTCTTGGTC 


AATGGTCAAG 


TCAAGAAAGC 


TAAATACACA 


GTCCAAGAGG 


660 


GTGATGTCGT 


CACTTACCAT 


GTGCCANAAC 


CAGAGGTATT 


AGAGTATGTG 


GCTGAGGATC 


720 


TTCCGCTAGA 


AATAGTCTAC 


CAAGATGAAG 


ATGTGGCTGT 


CCTTAACAAA 


CCTCAGGGAA 


780 


TGGTTGTGCA 


CCCGAGTGCT 


GGTCATACCA 


GTGGAACCCT 


AAGTAAATGC 


CCCCCATGTA 


840 


TCATATTAAG 


GACTTGTCGG 


GTATCAATGG 


GGTTCTGCCT 


CCANGGATTG 


TTCACCCGTA 


900 


TTGATAAGGA 


TACGTCACGT 


CTTCTCAT 








928 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CTCTTTTACA GAGGTGTTTC CTATTTTGTG CTTCCATTTT TGGGGCAAGG AATTGTAGAT 60 
GGGGATGGCA ATCCTATCTT TTTATTGATT. ATGATATTCG TTTGCTTCAT AGTTTTAGTC 12 0 
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TTTTTGAAAT GGTTAGACTA TGATTTCACT AGATTGAGAA GGGAGTTTCT AGATACAGGT 180 

TTTCAAAAGT CTCTTACTAA GATTAACTGG GCAATGGGGG CTTATTATCT AGTGATGCAA 24 0 

AGTCTATCTT ACCTTGAATA TGAACAAGGT ATTCAATCAA CGACTGTTCG CCATCTCATC 300 

CTAGTGTTTT ACCTACTCTT TTTTATGGGG GGTATCAAGA AATTGGATAC C T ATT TG AAG 36 0 

GAAAAACTTC AGGAGGAACT GAACCAAGAG CAGACCTTGC GCTACAGAGA TATGGAACGC 42 0 

TATAGTCGGC ATATAGAGGA ACTTTACAAG GAAATTCGGA GTTTTCGCCA TGACTACACT 480 

AACCTCTTAA CCACTTACGT TTGGGCATTG AAGAGAAGGA TATGGAGCAG ATAAAAGAGA 54 0 
TGTACGACTC GGTCTTAAGG GATTCCAGTC AGAAATTGCA GGACAATAAA TATGACCTGG 
GCAGATTGGT GAATATTNGT GACCGTGCCC TCAAGAGTCT CCTAGCTGGA AAATTTATAA 
AAG 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



600 
660 
663 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 53 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CTAGATTAAT CTGATATTTC TTTTTGGCAA ACCAAATGCT TCCTATCAGA AAAGCTAGCA 
ACAGCAACAT GGTAATAATG ATATGAATGG TCATTTTCTT CTCCTATTCT GCCTTTTCAA 
TATCTTTTTT CATCTCGTCA ACATTGAACT TAGCAAACAA GTATTGACGG TCTTGGACTG 180 
GGAAACGTTG GTCCAACTGG TCAACTGCTC CCACCTCGAT AATGCCTTCC TTGATGACAA 2 40 

AATCCATGAC ATGTCCACCG AAGGTCAAAT CATCTGATAT GAAGTGCAGA TGGTAGCCTG 
CCACACTGAC CCCATGGAAA ATCTCAGGCG TCCAGAAACC AACAATAGTC CCCGCAACAT 
TGTCACGACT ATATTCCGGT TGATGGGTTG CGACATCAGC AAACTTGGTA TCGGGTGTTG 
ACTTAGGAAT CATACGCACA TGCATATGTT AAAAATTCCC CCCCGAATCT TGATAGAGCG 
GAAAAGATTT TCCCCATCAT AATAAGACTC AATTCGTTCT TTCAATTCCT TGTCTGTCAT 
CTCAAAGCGC TGGCGAAAAA TGACCTCTGC CTGATGCGGT ACCACTGCAG CGTAAGGAAT 
AAGGGCATCT GGTGACACTT CCACAATTTC TGGCTGGTCT CCTGACCCTT TGG 



60 
120 



300 
360 
420 
480 
540 
600 
653 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



CTACGATATC 


TTTGGTCTTT 


TGTAAGATAT 


GAGGTCCACC 


CTT ATGCGC C 


X v. AVJ X X \J\JV. r\ 


fin 


TTTCATGCGA 


TTCAAGAAGT 


TGCCCCTCTT 


GATCAACCAA 


ACCATACTTG 

**- v — v — X /IV, X X w 


ATGTTGGTTP 

* \ x w X X ww X X V— 


i ^ n 


CACC1G AT ATC 


AATTGCAACG 


TAATATGTCA 

X **** X X X *wXX 


TAAATACCTC 


w x l x x rtvjn x l 




loU 


W X V_ X X WW X X 


TCACGAATCA 


AGGCAGCAGC 


CGCTTCTACA 


ACTGGACGAT 
nw x \jvjn^\jA x 


v_ x x w x X <wnvj x 


** \J 


CACTGGTGTC 


AATGGTGAAC 


GAACAGATCC 


AATATTCAAG 


CCTTCATTC5A 

^- V» X X * \ X X \J£\ 


X X X X LAnunL 




TTTTTTC3ATG 


ACACCGTACA 


TATTTCCATG 


AGCAGAAGTG 


AGTTTAfrAA 
nw x x inv.v./v% 


X wx\ X X vjUvj X X 


"\ fx r> 


GATAGCATAC 


TGCAATTCAC 


GCGCTGTTTC 


TAGGTCCTTA 


TCCGCAATTA 


rvv» x un x 1 sjr\kj 


/ion 


TTTCAAGAAG 


AGTTCTGGCA 


TAGCACCATA 


AGTACCACCG 


ATACCAGCPP 




a fl n 
f* o u 


GAGGCGTCCT 


CCTAGGAACT 


GCTCATCAGG 


ACCATTAAAG 


ACGATATGGT 


rTTfTPParr 

v.. x X 1 \_ v_ 


J ** u 


AAGGCTGACA 


AAGGTTTGGA 


TATCTTGAAC 


TGGCATAGAA 


GAGTTC TTCA 


CACCGATAAC 


600 


ACGAGGATTT 


TTCAACATTT 


CTGTGTAAAG 


GCTTGGAGTC 


AAAGC AACC C 


CTGCCAATTG 


660 


AGGAATGTTG 


TAAATCACGT 




TGGAGCTGCA 


GAACTGATAT 


CGTTCCAGTA 


720 


TTTGGCAACT 


GAGTTATTCT 


GGCAAGCGGA 


AATAAATTGG 


TGGAATCCGT 


TGCAATAGCA 


780 


TCTACTCCCA 


AGCTTTCAGC 


ATGGCGAGCA 


AGTTCCATAC 


TATCTTTAGT 


ATTATTGCAA 


840 


GCAACATGGG 


CAATAATGGT 


CAATTTACCT 


TTGGCTACCG 


CCATGACTTC 


TTCCAAAATC 


900 


AACTTGCGAT 


CTTCAACGCT 


TTGGTAGATA 


CATTCACCAG 


AAG AACC ATT 


GACATAAGAC 


960 


CTTGAACACC 


TTTATCAATG 


AAGTATTGA 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 



TCAAAGAGTA 


ACAAAGGCAC 


CAAATTCTCG 


ATAGGAACGA 


TTTAGCACGG 


TAAACTTCAT 


60 


CCACTTGGGT 


TCACGGAACC 


AAACCAGCAA 


TAATTTCTTT 


GGGCACGGGT 


TAATAGCATT 


120 


TTGGTCAACT 


AGGAGTAGAT 


AGAACACATT 


TCNTTCTTCG 


TCTATATCAA 


TCTTAACACC 


180 


TGTTTCAGCG 


ATAATCTTGT 


CGATGGTTTC 


TCCACCCTTA 


CCGATGACAA 


TCTTAATCTT 


240 


GTCCACATCA 


ATCTTGATCG 


TATCAATTTT 


CGGAGC AGTT 


GGAGCCAATT 


CTGGACGAAC 


300 


TTCTGGAATG 


GTTGCTTCAA 


TGACATCAAG 


GATTTCAAAA 


CGCGCTTTCT 


TGGCTTGAGC 


360 


AAGAGCCTCC 


GTCAAGATTT 


CTGCAGTAAT 


CCCTTGAATC 


TTG ATATCC A 


TTTGAAGGGC 


420 


TGTAATC CC A 


TCACGAGTAC 


CTGCAACCTT 


GAAGTCCATA 


TCTCCAAAGT 


GATCTTCCAA 


480 


AC CTTGGATA 


TCTGTCAATA 


CTGTGTAGTT 


ATTTCCATCT 


GAGATAAGTC 


CCATAGCAAT 


540 


ACCAGCTACT 


GGCGCCTTGA 


TTGGCACACC 


AC CAGCC AT A 


AGGGCAAGAG 


TTCCCGCACA 


600 


GATAGAAGCT 


TGAGATGAAG 


AACCGTTTGA 


TTCCAAAACT 


TCTGCTACTA 


GACGGATAGC 


660 
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C A A fT* 
V- Anu vjO V_ J\ e \_ 


*7 -> /•> 

72 0 


1 A 1 1 Vj 


Ton 

780 


CATC AATfi A T 


o 4 U 


GCCCAcrs a<*:t 


?UU 


A A Ars/'s A/ry a 


9 60 


cr3 r rr , *i^/*^ a 


1020 


1 LAAA 11C 11 


1080 


1 1 kaAVj l ib 


1140 


U Av, I bTTGTA 


1200 


v.1 1L1 VjV— 111 


12 60 


1 sj/\k_ huC lie 


1320 


C A CC A f2 A Tr 


1 3 80 


ATTGCTCTGC 


1440 


GTACCCCAGC 


1500 


CAAACATAGC 


1560 


CTTGGACTTC 


1620 


G 


1671 



(2) INFORMATION FOR SEQ ID NO : 5 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CTC AGTATC T TTGTAATCAA CATATCCAAT TTTGTTTGCT GCGATGTANT CAACTTTTTT 
ACGGCGTTTG AATCCGCCAC GACGTNGTTG AGCCATGTTT TTNCTCCTTT ATAAGTTTAG 
TTGTCCATTA GAATGGTAAA TCATCANCTG AAATATCCAA TGGGTTTGTT GCTCCAAATG 
GATTTTCATT ACGTGAAAAG TCTGGTACTG AATTTGTAGG TGCTGAATAG TTTGCAGTTG 
GTGCAGAGTA AGCTCCACCT GTGTGACCCT CACGCACACT ACGGCTTTCC AACATTTGGA 
AATTCTCAGC CACGACCTCT GTCACGTAGA CACGTTGTCC TTGCTGGTTA TCGTAACTAC 
GAGTCTGGAT ACGACCTGTC ACCCCGATAA GTGAGCCTTT TTTAGCCCAG TTAGCAAGAT 
TTTCAGCCTG TTGGCGCCAC ATAACGACAT TAATAAAATC AGCCTCACGT TCACCATTTT 
GACTCTTAAA TGTACGGTTT ACTGCAAGAG TAAAAGTCGC AACTGCTACA TTTGATGGGG 
TATAACGCAA CTCAGCGTCA CGTGTCATAC GCCCTACAAG TACAACATTG TTAATCATAG 
TTTACCTTCT TACGCGTCAA TTTTGACGAT CATGTGACGA AGAATGTCAG CGTTGATTT- 
GGAAAGACGG TCAAACTCTT TAGAGAGCTG CATCGNTCAT TTGCTTCAAC GTTAACGATG 
TGGTAAAGTC C 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
731 
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(2) INFORMATION FOR SEQ ID NO: 52: 



IPCT/US97/3D5306 



<i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 
. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



CTGGGTTTTA 


CCACGTTTTC 


CTGATGTACG 


AGCGAGATCC 


TTACGGTTCA 


AL. AI AGTGTT 


6 0 


GATAAAGCTG 


GATTTACCAA 


CATTTGAACG 


CCCTGCTAGG 


GCAATCTCTG 


GCAGTTCATC 


120 


CTGCGGATAG 


TGGGACTTAT 


TAGCTGCACT 


GAGCAAGATT 


TCAGCATTGT 


GTGTATTAAG 


180 


TTCCATAGTC 


ACCTCTAGGC 


TGTTTCTAGG 


ATCGGTTTAT 


CCGTTCCATC 


GACAGTTTCT 


240 


TTAGTGATGC 


GAACCAATTT 


CACATTTTCC 


TG AC TCGGC A 


CCTCAAACAT 


AACATCTAGC 


300 


ATGGTTTC TT 


CGATGATGGA 


GCGAAGTCCA 


CGCGCCCCTG 


TCTTCCGTTC 


GATTGCTTTA 


360 


TTAGCAATCT 


CTTGAAGGGC 


TTCGTCGTCA 


AATTCCAACT 


CAACATCATC 


ATAAGAAAGC 


420 


AAGGTTTGGT 


ATTGTTTCAC 


CAAGGCATTT 


CTTGGCTCTT 


TCAAGATGCG 


AACCAAGTCC 


480 


ATCAACGGAC 


AATTGCTCAA 


GAGCCGCAAA 


AACAGGCAAG 


CGTCCAATCA 


ACTCAGGGAT 


540 


AATACCAAAT 


TTTTGAATGT 


CTTCAGCGAT 


GAATTCTTGC 


ATGTATGAGC 


TGTTTCTCTG 


600 


TCAATCGCCT 


TATTGTTTTG 


ACCAAATCCG 


ATGACTTTTT 


CACCCAGACG 


TTGTTTGACA 


660 


ATTTCTTCAA 


T AC CATC AAA 


AGCACCACCC' ACGATGAAGA GGATATTTTT 


TGTATCCACT 


720 


TGAATCATCT 


ACTTGTTGTG 


GATGTTTGCG 


TCCACCTTGA 


GGCGGCACGC 


TAGCAACAGT 


780 


TCCCTCAATA 


ATCTTGAGAA 


GGGCTTGTTG 


CACCCCTTCA 


CCAGAAACAT 


CACGTGTGAT 


840 


AGACACATTC 


TCACTCTTCT 


TGGCAATCTT 


GTCAATTTCA 


TCCACATAGA 


TAATGCCACG 


900 


CTCTGCACGT 


TCGATGTTAA 


AGTCAGCAAC 


CTGCAAGAGT 


TTGAGGAGTA 


TATTTTCCAC 


960 


ATCCTCACCC 


ACATAACCAG 


CCTCCGTCAG 


AG 






992 



(2) INFORMATION. FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1299 base pairs 
{ B) TYPE: nucleic acid 
<C) STRANDEDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ATTTTCACAG GGCAGGATGC GGAACGACTA GCCCCATACT TTAACGGAAT TTTGGTAGGG 60 
ACAGCTCTTA TGCAGGCAGA AAATGTGGCC CAGAGAATCA AGGAGTTGCA GATTGACAAA 120 
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GGTTAAAATT TGTGGACTAT CGACCAAAGA AGCGGTGGAA ACAGCCGTTT CAGCAGGAGC 
CGACTATATC GGTTTTGTCT TTGCACCTAG TAAAAGACAG GTGACTTTAG AAGAGGCAGC 
TGAGTTGGCA AAGCTTATTC CTGCAGATGT GAAAAAGGTT GGAGTATTTG TTTCACCAAG 
TCGGGTAGAA CTGCTGGAAG CGATTGACAA AGTTGGCTTG GACTTGGTTC AAGTTCACGG 
TCAGGTGGCA GATGATTTAT TTGAGAATTT GCCTTGTGCC AGTATTCAGG CTGTGCAGGT 
AGATGGAAAT GGGCATGTCC CCAATTCTCA GGCAGATTAT CTACTCTTTG ATGCCCCTGT 
GGCAGGAAGT GGCCAGTCCT TTGATTGGGG TCAACTGGAT ACGACTGGAC TAGCACAGCC 
CTTCTTTATC GCAGGTGGCC TTAATGAAGA TAATGTAGTA AAAGCAATTC AACATTTTAC 
TCCCTATGCA GTAGATGTAT CGAGCGGAGT GGAGACAGAT GGACAAAAAG ATCATGAAAA 
GATTAGAAGA. TTTATAGAGA GGGTAAAGCA TGGCATATCA GGAACCAAAT AAAGATGGAT 
TTTACGGAAA ATTCGGCGGA CGTTTTGTCC CAGAAACATT GATGACAGCA GTTTTGGAGT 
TGGAGAAGGC CTACCGTGAA AGTCAGGCAG ACCCAAGTTT CCAAGAGGAA TTAAACCAAC 
TCTTGCGCCA GTACGTAGGA CGTGAAACTC CTCTTTACTA CGCAAAAAAC TTGACCCAGC 
ATATCGGCGG AGCCAAGATT TATCTCAAAC GGGAAGACCT TAACCATACA GGAGCCCACA 
AGATTAACAA TGCCTTAGGA CAAGTTTGGC TTGCCAAACG CATGGGTAAA AAGAAAATTA 
TCGCAGAAAC GGGTGCTGGT CAGCACGGTG TGGCAACTGC AACTGCTGCG GCCCTCTTTA 
ACATGGAATG TACCATCTAC ATGGGTGAGG AAGATGTCAA ACGCCAAGCC CTCAATGTGT 
TCCGTATGGA GCTTTTGGGA GCTAAGGTTG AAGCAGTGAC AGATGGTTCG CGCCTGCTCA 
AGGATGCGGT CAATGCAGCC C TTCGTTC AT GGGTGGCTAA TATCGACGAT ACCCACTATA 
TCCTTGGTTC TGCCTTGGGG CCTCATCCAT TTCCAGAAA 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1299 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:54 : 

CTGTTGAATC AACAACACGT ATCCAAGTAT CAGNATCAAA TTATGCAGGT AATCGTACAA 
TTGGAAATCA CCGTGGATGG TTCAATCCAA CAACAACTTC TGAAGGTTTT . GTTACATATA 
TTTATGCAGA TTAATTTACA GAGGGACTCG AATAGAGCCC TCTTTTCAGG TTTTACCGTG 
ACAATCCCTA TTAAAAATTA TATCAAAATA GCTTGAAAAT ATTGGAAAAG 
TGAAAATTGT CGTGTGAACG ATAATACTCA TTCTTGATGA ATTGTGAAGC AGTTGCCCTT 
GGGTCGTTTT GCGAGTTGAA GTCAAGAAGA GGAAAAAAAC AAAAAGGAGA AATACTCATG 
GCAGTAATrP CAATGAAACA ACTTCT^AG GCTGGTGTAC ACTTTGGTCA CCA^CTCGT 
CGCTGGAATC CTAAGATGGC TAAGTACATC TTTACTGAAC GTAACGGAAT CC^TA^ 
^ C ™rT C AMCW ~ k ATACGCTGAC CAAGCATACN ACTTCATGCG TGATGCAGCA 
GCTAACGATG CAGTTGTATT GTTCGTTGGT ACTAAGAAAA CAAGCAGCTG ATGCAGTTGC 
TGAAGAAGCA GTACGTTCAG GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT 

89 



60 
120 
180 
240 
300 
360 
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TACAAACTGG GGAACAATCC AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACGTGGATG 72 0 

GAAGAAGATG GAACTTTCGA AGTTCTTCCT AAGAAAGAAG TTGCACTTCT TAACAAACAA 78 0 

CGTGCGCGTC TTGAAAAATT CTTGGGCGGT ATCCGAAGAT ATGCCTCGTA CCCCAGATGT 84 0 

GATGTACGTA GTTGACCCAC ATAAAGAGCA AATCGCATGT TAAAGAAGCT AAAAAATTGG 90 0 

GAATCCCAGT TGTAGCGATG GTTGACACCA AT ACTGATC C AGATGATATC GATGTAATCA 96 0 

TCCCAGCTAA CGATGACGCT ATCCGTGCTG TTAAATTGAT CACAGCTAAA TTGGCTGACG 102 0 

CTATTATCGA AGGACGTCAA GGTGAGGATG CAGTAGCCGT TTGAAGCAGA ATTTGCAG 107 8 

(2) INFORMATION FOR SEQ ID MO: 55: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



CTATGAGGGA 


TATAAAGTGA 


GAGAGTTGGG 


CCAACGTTCC 


AAAAACGTCC 


TAAATTCTGC 


60 


CCATTGACAA 


AGGCAACCCC 


CTTACCAAAC 


TCAGACNAGT 


CTAGGTAAGT 


ATCTTTTGGC 


120 


TATTCGACTG 


TAAAGTCATA 


AGCGTAAAAG 


GCTGGTTGTC 


CTTGAGTCCA 


TCCTTTGGAA 


180 


AAATCAATTT 


TCTCAGGATT 


GTCTAGTGGG 


AGTGGATAGT 


GTTTCCAGTT 


TAGTAAGAAA 


240 


TGCAGATCCT 


TACAGACCCC 


TGTCCGAATT 


CCCTTACGTT 


GCGTATCCGC 


TAAGAACTTA 


300 


TGCCCATAGT 


TGACACGCCC 


CATATTTTCT 


ATCAAGATAT 


CT AAC C TAG A 


TAGCCCTTTC 


360 


TTTTTACCTT 


GATAAAAAAT 


ATCTTCCCCA 


ATCTCTGTCT 


GATATTGAGT 


TTTAACCCAC 


420 


TGACCATCGA 


CATACAGCTG 


GGCCCTATCT 


CGACCATCAA 


TGATACGAAG 


TCTTTCTTCT 


480 


TCTGCATCCC 


AGTTTGTTTC 


TGTTCGATAA 


AGTAGGTAGC 


CATAACTTTG 


TCCCAGCTCC 


540 


TCCATCTTTT 


GAGGATAGAG 


ACTTTCTACA 


GGG AC TTG AC 


AAGCTATCTA 


AGGTTTCAAA 


600 


CAAAGAAACT 


TTTTCAACTA 


GTGGAATAGC 


ATCCAACTCC 


ATACTCTCTT 


TGTAGAGTGG 


660 


TTCCAACTGC 


GGATACTCTG 


AAAAATGTGT 


TGCCATCATC 


TTCTTGACTG 


CAAGATATTT 


720 


AGCAGTTGGA 


TTTCCTTCTT 


CATCCAGAAG 


GGCATCGTAA 


TCATAAGACG 


TTAACTTGTG 


780 


GCAGGTCCAA 


ATTCCTCGAG 










800 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi ) 


SEQUENCE DESCRIPTION: 


oty ID NO: 


56 : 






CTCACGCAGT 


TCTGTTAC C A 


AAGACCGTTC 


A A A A A/^OA A H 


CTAGCAATAC 


TTATAACTTA 


60 


TTAGAAATTC 


CTAATTTTAC 


CTTGCTAAAC 


uAA I AL. AGGT 


C ACGATGC C A 


TCTGAGTCTG 


120 


TGATTTCCAA 


TTGGCTCGAT 


GTCATCCATT 


JL bATTGGTAC 


GTCAACTTCT 


TGTGC TCGTT 


180 


GGGCAATCGT 


TAACAGTTCT 


TCTTTATGTG 


LAACTTCGAT 


GACATAGTAG 


GCTAAACCTG 


240 


GTAAGCCTTG 


CTTACGCGGA 


AGCCAGACCT 


TTTCCTTCCC 


CATTCGTTAA 


CTGCTAAATG 


300 


ATGATGGTAA 


TCTCCAGCCG 


CAATCCAACT 


AL»C c aggc 


ACACTGAATT 


TATTCTCTAG 


360 


CCCTAACACC 


TTTTGATAAA 


ACTGGCTGGA 


LH 1 CCaACTA 


TCCTTGACGG 


AAAGATGAAT 


420 


ATGCCCCATT 


CTTGTACCTT 


CTGCCAGGAT 




ACTCTTTCCC 


CCAACTCATA 


480 


AATGTCCTGC 


GCCGCAAGAG 


CCTCCGTCAC 


TCCGATAATG 


CGTCCATCTT 


CTCGAATATC 


540 


CCATGTGGAA 


ACTGGCTTAT 


CTCGATAGAG 


TTCAATGCCA 


TTTCCCTCCA AATCTTCCAA 


600 


GTAAATAGCT 


TCACTGTAAC 


CATGGTCTGC 


ACCGCCGACA 


AGAGGAATTT 


GCAAATCTGT 


660 


CAGATGTTTC 


AAGACATCAG 


CCAAAGCCTT 


GCGTGTGGGC 


AAGAGAATGG 


CCAAATGGTA 


720 


AAGACCATAA 


TGTTCCCTCA 


CTTCTCCGCT 


CTTCTTGTGC 


TTGAATCAG 




769 



(2> INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



TTACCCCAAG CCTGCAACTT CCCAAGCTGA TTGGTTGAAT GATTTGTCAT CAACACCACC 60 
AGTATCAGTG ACGATTGCTG CTTTTGTCTT CACATCAGAA GATGAAGCTG CGTTACGAGA 120 
AGAGCGGTTA CCACATGCAG CAAGTCCAAC TGCTGCCACT GCAACTAGGC CAAGACCTAG 180 
CCATTGTTTC TTGTTCATTA CTGAACCTCC TAAATAAGAT GTGCAACGAT GTTGCAAGTA 
TGGATTGATT GGTCACAAAG ACCCTTGCCA CTCAAAGAGC GACTCAGACT AATTTAAGTC 
TGTAAAAGAA TATGGAAGTA ACTCCCCGAC CGTCATCTCG AC CGTCGATT TATCTTTTGC 
GACTAAGGTC ACTTTTAAAT CTTGTTCAAA AAATTCGACC ATTACTTGGC GACAAGCACC 
ACATGGTGAA ATCGGTTTTT CAGTTTGACC ATAGACAATC AATTCTGAAA ATTCTCTTTG 480 
GCCTTCAGAT ATAGCCTTAA AAATAGCTGT TCTCTCACCG CAATTGGTCC AAAGGATAGC 540 
TAGCATTTTC GATATTCACT CCCGTGTTAA ACATTTCCGT CTTTGGCTAC TAAAACTGCT 
CCGATAGGAA AGTGAGAATA GGGGACATAG GCATGTT 



240 
300 
360 
420 



600 
637 



(2) INFORMATION FOR SEQ ID NO: 58 



(i) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 940 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



CTGAGGATAA 


TGACATCTGC 


CANAGGAGCA 


AAACTAGCAA 


AATCATCNGT 


TGCACGGTCA 


60 


ATCATGCCTT 


CTTTCAAGGC 


GATATCTCTC 


GAAGCTTGAC 


TACGGATTAT 


AACCTAAAAT 


120 


TTCATAATCT 


GGATGATCGC 


GTTTGATACC 


AAGTGC C ATA 


GAGGCTCCAA 


TCAACCCAAG 


180 


ACCTGCGATA 


TAGATTGTTT 


TTGCCATAGG 


AACTCCTTAA 


TAGTTCTTTG 


TATAGTCTCG 


240 


GTGTTTGGCT 


ACCGCTTCTT 


TTAGTTCCTC 


AAGATTATCT 


GATGAGAATT 


TTTCGAGGAT 


300 


TTCTTGCGCC 


AGAACCGTTG 


CTACAACTGC 


TTCCATGACC 


ATTCCTGCAG 


CTGGAAGAGC 


360 


AGTCGGATCA 


CTTCTCTCCA 


CGGTTGCCTT 


GTAAGGTTCG 


TGGGTTTCGA 


TATCCACACT 


420 


CATAAGAGGT 


TTATAAAGAG 


TAGGAATGGG 


TTTCATGACC 


CCACGAACAA 


CGATGGGTTG 


480 


CCCATTAGTC 


ATACCACCTT 


CAAAACCACC 


TAGATTATTG 


GTACGGCGAG 


TATAACCGTC 


540 


TTCTTTAGAC 


CAGAGAATTT 


CATCCATAAC 


TTGGCTGCCT 


TTACGATAAC 


CAGCCTCAAA 


600 


GCCAAGACCA 


AATTCCACCC 


CTTTAAAGGC 


ATTGATAGAG 


ACAACAGCTT 


GAGCC AATCT 


660 


TGCATCCAAT 


TTTCTATCCC 


ATTGGGACAT 


AGGAACCAAG 


ACCAACTGGG 


AACGCCTCCG 


720 


ACGACTGTCT 


CCACAACCCC 


ACCGATGGTA 


TCACCATCAC 


GTTTGATTTG 


GTCAATATAG 


780 


TCCTTGATTT 


CCTGTTCTCG 


TTCTTGGTTG 


ACAATAGAAA 


ACTTCCAGAC 


TGGGCAGCTC 


840 


TTTGCTTTAA 


TTTCCAGCGA 


CTGTCAGATT 


TTCCAGGAAC 


ATCCATTCCC 


TTGCCACCAA 


900 


AG AC C ACGAC 


ATGGTTGGCA 


ATC TC CAT AT 


CCAGATTCGA 






940 



(2) INFORMATION FOR SEQ ID NO: 59: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GATTCTACAG 


GATCCATTTT 


AC TATCTTAC 


GCGCCGGGAA 


GTAGGCTGAG 


ACATAACCAA 


60 


GTAATAGAGC 


CAACAACTAG 


AGTTCCTAAA 


ACAGATAAAA 


GATTTAATTC 


AAAAACCTTA 


120 


GTGATGGATG 


GGTAAAAGTG 


ACTTACAATC 


GCATTCGCCA 


AACTTCCCAC 


CCCTTGTGCA 


180 


ACCAAAAATG 


CCAGCAGCAA 


GGCGATGCCT 


ACAATCCAGA 


TAGCCTCGTA 


AATAAAAATT 


240 


C CTTTG AC AT 


CACGATTCTG 


ATAACCAACT 


GCTTTCATGA 


CACCTATTTC 


CTTGGAACGT 


300 


TGCATGATAT 


TGATGTAAAT 


AATG AT ACC A 


ATCATAACCG 


CTGCTACCAC 


AATAGCTTGT 


360 
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GATGAAAGCA CAATCAATAA TCCCTGAATA ACACGAATAA AGGTAATCAC AATATCAAGA 420 

ACTCTCTGTT AAGAAAGCAC AGTATACTTC TTATTTTTCT GTAATTCTTC TGTTACTACT 4 80 

TTTGTCTGTG ATGGATCTTT GAGTTCCAAG ATAAAATAAG ATACAGCTTT CGTAAATCCA 540 

GCCTCTTTCA AAATCGTTTC CATTTGATGA GACAGCATGA AACTGTTGCT GTCCTCCATG 600 

TCATCTTCAT CATTGATTAC ACGTACAATC TTCGTTTGAA* ATTGAGCAAT CTTACTAGTT 660 

TCGGCAGCAC TTTCTACAAT GCTGACTGAG ACTGATTTGC CAATAAGATC ATTAGCTGTC 720 

AAATTTTTTC CTGTCTGTTC ATTCCAATTT TTTAGTAAAG CTGCTTGGAA TCGTTAATCC 780 

CTGTTCATTT GTATCAGTAT AGAGGGATCC AGCCAACACT TTGTCCGTCT CATTATTACT 840 

AACAGAGATA CTTGTATCAT CATAAAGACT CACTACTTGA GCATAAGAAG CATCGTTTGA 900 

CTCAAATCCA TTTCTTGCCC ATCTTTTCTT GCCCATCTAT AGTAATATTT GACATGTTCA 960 

TCCCNAAAGG ACTCTCCAAA TATNNNATAG g90 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


60: 






ATCGGCTAAG 


ATGCGTGGAG 


CATCACAAAT 


TATCCTTATG 


AGCCGTCATG 


AAGACGTCAA 


60 


AAGATGGCTA 


TGGAGTCAGG 


TGCGACAGCT 


GTTGTTGCAG 


AACGTGGTCA 


AGAACGAATT 


120 


ACCAAGGTGC 


GTGAAATCCT 


CGGCGGAGGA 


GCAGACGCAG 


CACTTGAATG 


TGTTGGTACG 


180 


GAGGCTGCTA 


TAGAACAGGC 


GCTAGGTGTT 


CTTCATAATG 


GAGGGCGTAT 


GGGCTTTGTA 


240 


GGAGTCCCAC 


ACTATAATAA 


TCGTGCTCTT 


GGTTCGACAT 


TTATGCAAAA 


TATCTCTGTA 


300 


GCAGGTGGGG 


CAGCTTCTGC 


TACAACATAC 


GATAAGCAAT 


TTTTACTAAA 


AGCCGTCCTT 


360 


GATGGTGATA 


TCAATCCAGG 


TCGCGTCTTT 


ACTTCAAGTT 


AT AAAC TGG A 


AGATATCGAC 


420 


CAAGCCTACA 


AAGATATGGA 


TGAACGTAAG 


ACCATTAAGT 


CTATGATTGT 


AATCGAATAA 


480 


AAAACGAATA 


GGAGTTTTAG 


AACTCTATTC 


GTTTTTTATG 


TTATCCTATT 


CTTGATTTAA 


540 


GGTACTTTCT 


CTTAATGTCA 


GTCTGGTTCC 


CAGCATGGTC 


AGGCTAGGGA 


TTTTCCGACC 


600 


GTGGAGG AC T 


TCCTTGTTAA 


GAATATCCAT 


ACCTGCTCGG 


CCCATTTCTT 


CAGTATAAAC 


660 


TGTAATACTA 


AGAGAGGGGA 


GGATAGACCT 


GTTTGGTCAG 


ACTAAGTGTC 


GTTAAAAGGA 


720 


AAATGAGGCT 


GACGCGATCT 


GGCAGGCTGA 


TTCCAGCTTC 


TTGGAGGGCA 


CGGAGGGCAC 


780 


CGATAGCTAA 


ACTATCGCTG 


GCTGCGAAAA 


ATGCTGGCGG 


AAGTTGGTCT 


CCCAAGCTCT 


840 


G AATGGCC TC 


CTTCATTAAG 


TCATAGCCAG ACTGGGCAGT 1 AAATCTTCCT 


TGAAAGACCA 


900 


GTTCATCATG 


ATAGATTCCC 


CTCGCTTGAC 


TGTAGTTTTT 


GAAGTTTTCT 


AGACGCTTGT 


960 


CCTGAATGAT 


TTCTTCTTGG 


TCTGTAGTTT 


CTTCAAGGCC 


TGTTAGAATC 


CCGATACGGT 


1020 


CCATTCCTTG 


ACTGAGGAAA 


TAATCGACAA 


C CTGTTTC AT 


AGCAGTGTAA 


AAATCCGTGA 


1080 


TAATACAGGT 


ATGTCCCAGG 


GAAAGTGTAT 


CGCTGTCTAG 


AAATACAAGA 


GGCTTTTGGT 


1140 


ATTCTTCAAA 


GGCAGAAATC 


TGAGCTTCTT 


GATAGGATTG 


ATTCTGCTAG 


GGATAACAGC 


1200 
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CAAAGGCAAT CCTCCCTTTA TCCACCTCAT ATTTGAAACC ATTTCAGCTC TTA 125 3 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1211 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



CGTGTACCAT 


CAAGAGGCTA 


ATTGATGTTT 


CTTCCCCCTG 


TGCCTCTCTT 


GTCAACATTT 


60 


CTGTTTGAGC 


CTTATGCGCC 


TGTAATAAGG 


CAGCATTCGC 


ATCAGCTAAT 


CTTCGATTGG 


120 


CTTTAGAGAA 


ATCCCCTTTT 


TTTGCTGCCT 


GAATAGCCTC 


CATAGCATTA 


CTTTTGGCTT 


180 


CCCCACCATA 


CATAATTAGC 


CCCATTACAG 


ATTCTAAATT 


ACTTTCATCC 


ATAACATTCT 


240 


CCTCAATCTT 


AGTCTAATAA 


ACTTTCAGCT 


AAATCTAAGA 


CTTTAGAACC 


ATTCATCATT 


300 


CCATAATCTG 


TCATCGGAAT 


AACCGCCACA 


GGAATCTGTC 


TATCTTTTAG 


TTTTTC TTGA 


360 


AAGTCCCCTA 


GTAAATAGCG 


AACTTGAGGA 


CCTAAAAGCA 


ACACATTTAC 


TTC TTTTGTT 


420 


GCTACAATTT 


CTTCTGCTTC 


AGGAGCTGGA 


ACTGCAAAAA 


TTTCTGCATC 


CAACCCCTTA 


480 


TCTTCTGCTG 


CCTTTTGCAT 


CTTTGTCACT 


AGCAGACTTG 


TACTCATACC 


TGCTGCACAT 


540 


GCTAACATAA 


TTGTTACTTT 


AGCCATAGCT 


TACTCCTTTA 


TCACCTATTT 


ACTGACCTAG 


600 


CCAAGCGAGA 


TACTGTATGT 


CTCACTCCTC 


TAATTGCCTT 


GCTCAATGCA 


TAAATATTAT 


660 


CGACTGTTGC 


CAGTCCCCCA 


TATCCTGCGT 


CACCTATATG 


TTGAATATCA 


ACTCCACAAA 


720 


TTTTATTTCT 


AAGTGCAATT 


TCCTTAATAG 


TATCTGTATC 


AGATGTTTCT 


TGGCTAGTAC 


780 


CAATAGCACT 


CAATACAAGA 


CCACCCTTAC 


TATGAACGAG 


ATCAACGACT 


TCACGCAACT 


840 


CTTGGTCATG 


AAAGGCTGGA 


ACGGTTCCAA 


CTGCTGGAAC 


AAGTATCACA 


TCCGCACCTG 


900 


CTTCCAACAA 


TTGCTCTGCG 


ACAGAAAGCT 


CTGCCACAGG 


CTCATTCACT 


CCTGCCCCGT 


960 


GCATCTTACC 


TGCAATAATC 


AAACCAGAAA 


AGTTTTCCTT 


AGCAGTTTGA 


ACAGCCTTAA 


1020 


TGATTTCTCG 


ATTGCTAACT 


CCTGTTCCAG 


GATTTCCAGT 


CAAACAGACA 


AAGTCAAAAC 


1080 


CTAACTCCTA 


TATTCGCTTC 


AATGTTTCAA 


CACTAGCAAC 


ACGACCTGCA 


ACAATTTCCT 


1140 


GTGTTTCCTC 


TAGCATTCTT 


TGCAGATGGG 


TCAATCGGTT 


CCAAATTTAC 


ACCAATTGGA 


1200 


CAAGCAACAA 


G 










1211 



(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/US97/U5306 



.(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CCACACTAGC AGCCTATGGA CTCACTAAAA GAAAAGAAGA CTAAGTCTTT TCGATAAAAA 
ATAAACAGCG AGATTGAAGC TCGC TGTTTA TTTTTTAATT AATCACCTAG TCCAAGACGT 
TCAAAGATAT CATCCACTCG TTTGGTGTAA TAAACTGGGT TGAAGATTTC ATCGATTTCT 
TCTTGTGTGA GACGTGATGT TACTTCTGAA TCTGCCTCAA GAAGTGGTTT AAAGTCTACT 
TGGTTGTCCC AAGAGTAGGC TGTTTTTGGT TGCACCAAGT CATAGGCTTG CTCACGGGTC 
ATGCCTTTTT- CAATCAATGT CAACATAGCC CGTTGGCTAA AGATAAGACC AAAAGTCGAG 
TTCATGTTTC GGATCATATT TTCTGGGAAG ACTGTCAAGT TCTTGACGAT ATTTC C AAAA 
CGGTTGAGCA TGTAGTCAAT CAAAATGGTC GTATCTGGTG TGATGATACG CTCAGCTGAT 
GAGTGAGAAA TATCGCGTTC GTGCCAGAGA GCGACGTTTT CATAAGCCGT AATCATGTGA 
CCACGAATGA CACGCGCCAG ACCAGTCATA TTTTCAGAAC CGATTGGGTT GCGTTTGTGA 
GGCATTGCTG AAGACCCTTT TTGCCCTTTA GCAAAGAACT CTTCTACTTC GCGTTGCTCA 
GATTTTTGTA GACCACGAAT CTCAGTCGCC ATACGTTCGA TTGAAGTCGC AATGCTGGCA 
AGAACCGCAA AGTACTCAGC GTGAAGGTCA CGAGGAAGGA CTTGTGTTAA AGATTCCTTG 
GGCACGGATG CCAAGTTTAT CGCAGACATA CTCCTCTACA AATGGTGGGA TATTGGCAAA 
GTTCCCAACC GCACCAGAAA TCTTACCAGC TTCTACACCA GCAGCCGCAT GC"CGAAGCG 
CTCCGATATT GCGTTTCATT TCGCTGTACC AAGTTGCTAA TTTAAGGACC AAAGGTTGTC 
GGCTCAGCGT GCACACCATG AGTTACGCCC CATCATGATG GTGAACTTGT GCTCCTTGGC 
TTGTCAGCGA TGATATTAGT GAAGTTTTCA AGGTCACGAC GGATGATGTC GTTGGCCTGC 
TTGTAGAGGT AACCATAAGC AGTATCCACC ACGTCGGTAG AAGTTAACCC ATAGTGAACC 
CACTTGCGCT CTTCACCAAG AGTCTCAGAA ACCGCACGCG TGAAAGCCAC CACATCGTGG 
CGCGTCTCCT GCTCAATTTC CAAAATACGG TCGATGTCAA AGTCCGCCTT CTTGCGAATC 
AAAGCCACAT CTTCCTTAGG GATTTCCCCA ACTCAGCCCA TGCCTCGTCA GAGAGGATTT 
CCACCTCAAG CCAAGCACGG TATTTATTTT CTTCACTCCA AATATTCGCC ATCTCAGGGC 
GAGAGTAACG GTTGATCATG TGTTAATTTT TCCTTTCTTC TTAAG 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CTATTCTGAG TATAAAGTTG GCGGAGGCTG GAACTACGCT CGTTATGAGG TCATAAACTA 
CTATACTGGA GGTTATTAAT CCTTAAAGAG TGAGAAAAGG AGGGCTGGAT ATGTTAAATC 
TTACTCATGT TACCTTAAAA ACGCGACAAG TCATCTTGCA AGATGCGGAT TTTACCTTTA 

95 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1425 



60 
120 
180 
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AAAAGGGTAG 


GATTTATGGC 


CTTCTTGCTA 


TCAATGGCTC 


GGGAAAGACG 


ACACTATTCC 


240 


GAGCTATGAG 


C AAGTTGC TT 


CCCCTTAGTA 


GTGGACACAT 


CGCAGTTCCT 


CCTTCTTTGT 


300 


TTTATTATGA 


GAGCGTTGAA 


TGGCTGGATG 


GAAACTTAAG 


TGGGATGGAC 


TACCTTCGTC 


360 


TCATAAAAAA 


ACATCTGGAA 


GTCAGACCTA 


AACTTGAGAG 


ATGAAATCGC 


CTACTGGGAA 


420 


ATGGCTGACT 


ATATCAGTCT 


TCCCATCCGC 


AAGTATTCCT 


TAGGGATGAA 


GCAACGCTTG 


480 


GTGATTGCTA 


TGTATTTTCT 


CAGTCAGGCC 


AAATGC TGGC 


TCATGGATGA 


GATTACAAAT 


540 


GGCTTAGACG 


AGTATTATAG 


ACAGAAGTTT 


TTTGATAGGC 


TAGCACAAAT 


CGATAGACAA 


600 


GAACAGCTGG 


TTCTTTTAAG 


TTCCCACTAT 


AAGGAAGAGT 


TGGTTGATAT 


CTGCGATAGA 


660 


GTAGTAACCA 


TTCATCAGGG 


GCAGATAGAA 


GAGGTTTAGT 


TTATGAAAGA 


TGTTAGTCTA 


720 


TTTTTATTGA 


AAAAAGTTTT 


CAAAAGTCGT 


TTAAACTGGA 


TTATCTTACT 


TTTATTTGCA 


780 


TCTGTACTCG 


GTGTTACCTT 


TTATTTAAAT 


AGTCAGACTG 


CAAACTCAGT 


CAGCTTGGAG 


840 


AGAGAGTTGG 


AAACTCGTCT 


TGTAGACCGT 


GAGAGAGTCA 


TCAATGAAAA 


TGAAGAGAAA 


900 


CTCTCCCAAA 


TGTCTGATAC 


CAGCTCGGAG 


GAATACCAGT 


TTGCTAAAAA 


TAATTTAGAC 


960 


GTGCAAAAAA 


ATCTTTTGAC 


GCGAAAGACA 


GAAATTCTGA 


CTTTATTAAA 


AGAAGGGCGC 


1020 


TGGAAAGAGG 


CCTACTATCT 


GCAGTGGCAA 


GATGAAGAGA 


AGAATTATGA 


ATTTGTATCA 


1080 


AATGACCCGA 


CTGCTAGCTC 


TGGCTTAAAA 


ATGGGGGTTG 


ACCG 
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(2) INFORMATION FOR SEQ ID NO: 64 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



CAGAGCAACT 


TCGTCAAATC 


TTCATAGGTG 


AGGTAACCAA 


TTGGAAAGAG 


GTTGGTGGTA 


60 


AGGACTNACC 


CATCTCTGTT 


ATCAATCGGG 


CAGCCGGCTC 


TGGCTCTCGT 


GCT AC CTTTG 


120 


ATACTGTCAT 


TATGGAAGGT 


CAGTCTGCCA 


TGCAAAGTCA 


GGAGCAGGAT 


TCAAATGGAG 


180 


CGGTAAAATC 


AATCGTATCA 


AACAAGTCCA 


GGAGCTATCT 


CTTATTTATC 


TCTTACCTAT 


240 


ATAGATGATT 


CGGTCAAAAG 


CATGAAGTTG 


AATGGCTATG 


ACTTAAGTCC 


AGAAAATATA 


300 


AGTAGCAATA 


ATTGACCCTT 


GTGGTCTTAT 


GAGCATATGT 


ATACATTGGG 


GCAGCCCAAT 


360 


GAGTTGGCTG 


CAGAATTTCT 


CAATTTTGTT 


CTCTCGGATG 


AGACCCAAGA 


AGGGATTGTC 


420 


AAAGGATTGA 


AGTATATTCC 


GATTAAGGAA 


ATGAAGGTTG 


AAAAAGATGC 


TGCCGGAACT 


480 


GTGACAGTGT 


TGGAAGGGAG 


ACAATAATGA 


ATCAAGAAGA 


ATTAGCTAAG 


AAAATGTTGC 


540 


TTCCATCAAA 


GAATTCTCGT 


CTGGAGAAAT 


TAGGAAAAGG 


TTTGACCTTT 


GCCTGTCTTT 


600 


CTTTGATAGT 


CATCCTTGTG 


GCCATGATTT 


TGGTTTTCGT 


AGCGCAAAAA 


GGCTTGTCGA 


660 


CCTTCTTTGT 


CAATGGTGTG 


AATATCTTTG 


ACTTTCTTTT 


GGGAGGAACT 


TGGAATCCTT 


720 


CTAGTAAAGA 


ATTTGGTGCC 


CTTCCTATGA 


TTTTGGGTTC 


CTTTATCGTT 


ACCATTCTCT 


780 


CAGCCCTTAT 


CGCAACACCC 


TTTGCTATTG 


GTGCAGCAGT 


TTTTATGACC 


GAAGTATCAC 


840 


CAAAAGGGGC 


GAAGATTTTG 


CAACCAGCTA 


TTGAACTCCT 


GGTTGGGATT 


CCTTCAGTAG 


900 
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TGTACGGATT TATTGGCTTG CAAGTCGTCG TTCCCTTTGT TCGCAGTGTC TTTGGTGGGA 9 60 

CTGGTTTTGG GATTTTGTCA GGGATTTCCG TCCTCTTTGT CATGATTTTG CCGACCGTAA 1020 

CCTTTATGAC AACGGATAGC TTGCGTGCGG TTCCTCCNTT ATTATCGTGA AGCCAGTTTC 10 80 
GCTATGGGA 



(2) INFORMATION FOR SEQ ID MO: 65 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1502 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



1089 



60 
120 
180 
240 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

CTNTNTTTAA AC CTTGACCT TANTTGANNA TTATACCGAA ATTNTCTTCA TTTTTAAAAG 
AAAAAAGGGC GCTGGTAAAA GGGATAATCT TCACCAACTC CCTATNTTTC TACTTATCTA 
AGCCTAATTC TGCCAAGATT TGACGTTTGT NGGCAATCTT TNGGACTTCN TNGTCCTCAT 
CTTCAGACCA GTCTAGTTTA ATTTCTGAAA CAATCTGCCC AGGGNGATTT NTCAAGATAT 
AGATGCGGTC GCTGAGANTG AGGGCCTCCT CAATACTATG CGTGANGATC AGGGTTGTTA 300 
GCTGCAACTG C TTGTGAATC TCAAGATACC AAGCGTGGAG TTCCATCTTT GTCATCTCAT 3 60 

CCAAGGCGCT AAAGGCCTCA TC TAAGAGAA AGAGCTTGTG CCCAAAAAGG TAGGTCCGGA 
GT AAGGC T AC ACGCTGGCGC ATCCCACCGC TAAGTTCATG AGGATACTTG TCTCTTACAG 
CTGTCAGCTG GAAGGTCGCA AGAATTTTAT CCGCTCGGGA AATAGCTTCT GCCTTATCCA 
CCTTTTGAAT CAAGAGGGGC AGAATGATAT TTCCAAGCAC CGTCTTGTGC TCCAAGAGCA 
GATCCTTTTG CAACATATAA CTCACGTGCC CCTTGGGATT TTCTTCACCA TCAAGGACAA 
TTCTCCCTGA CTGAACTTCT AAAATCC C AG CGATTAGATT AAAGAGGGTG GTCTTTCCAA 
CACCACTTGG GCCTAGGATG GAAACCACTT CGCCTGAAGT CACCTGTAGG TTGATATCCT 
CTAAAATCCT CTCCTGACCA TAGGCATAAC TGACGTGCTC TAGTCTAATT TCTGTCATTA 
TTTCACAAAT TCGTTGGTGA AGCCTTTGTC TGTCAAGTCT . TCTTTAAGG A TACCATTTTC 
TTTATCCCAT TTATAGAAAG CATTCCAGCG AGCTGCGTCA AATTGACCCC ATTTTTCCTT 
GTCGCTTGCG TATTCTTTTG ACAAGTATTT TTGAGATTCG ATGACAAAGT CACGTTTTTC 
CTTGAGTTCA GGTGCATTCT TGATGAGAAT ATCTGCAGCT TCTNCTGGAT GTTCCATGGC 
ATATTGGTAG CCTTTTNTGA TGGCTTGGAT GACTTTGCGA GCTTCTTCTT -GTTATCTTT 
CAGATAGTCG TTGTTTGCGA TGATAACTGG TGAATATTAN TCAAACTCCT TGACATAGTC 
TTTCAANTAC ATGAAGTTAG CATCTACACC TTGAGATTTA GCAAGGATAC CATCCCAACC 
GTAGTAAATC CNAGCAGTAT CAAAGACGCC ATTGGCAATC GGTGTGATTG AGTTTGAGTC 
NTTANTTGGT ACTNTTTCAA CCTTCTCACA GTCTCCNCCT TGAGATTCTA C C AAGGTTTT 
CAACATACNN GTTCANTTGG GTCATTCCAT GTCCCATATT TCTTACCAAC CAAGTCTTTT 
GGACTGCCTA CATTNTCACA TTACCGAGAG ATGATCCCTG ATGTATTGTG TTCCACCNAT 



AG 



420 
480 
540 
600 
660 
72C 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1502 
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(2) INFORMATION FOR SEQ ID NO: 66: 



PCT/US97/0SM6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 



ACTCTTCAGC 


GATTTCTTGG 


CTGAGAATGT 


CCTCATAGGC 


AGGGAGTTGG 


ATGTCCTGAC 


60 


CATATTTCTT 


CTTGAGGGCA 


GTAGTAACTA 


GGCGATAAGC 


CAGATTGGCA 


TTACGAGAGA 


120 


AG AT AGGCC C 


GNGGAANTAC 


GAACCAAAGA 


CATTCTTATA 


ATGAACCCCT 


TCACCGACCT 


180 


TTTCTTCGTT 


GTTTCCATTT 


C CAT AG AC AA 


CCTGTCCCAG 


CGGTTTTTGG 


TCATCAGAGA 


240 


GG AAGGTACG 


ACCTTGGTGA 


TTTTCAAATC 


CATAGTAGGT 


TTCATCGAAA 


TCTTTCATTG 


300 


TGAATCTTGA 


TGTCACCGAT 


AAAACGGTTA 


TTGGTCTGGT 


TGAGCGTGTA 


GTGTCCCATG 


360 


ACCCCTAGCC 


CTTCGATACG 


TTTTCCTGAA 


GCTTCAACAT 


AATATTGACC 


CAATAGTTGG 


420 


AAACCACCGC 


AG AT AGC C AG 


AACTACACCG 


TCGTTTTGGA 


TGTAGTTGTC 


AATGCTCTCT 


480 


TTTTTAGCAG 


GTAGGTCGTC 


TGCAATGATA 


CTTTGTTCAA 


AGTCTTGACC 


ACCACCGAAA 


540 


AAGGCGATGT 


CGTAGTGATT 


TTCATCAAAG 


TCATCATGGA 


GAGAAACGAT 


GTCAACGGTC 


600 


ACATGGGTTC 


CCAGTTTTTC 


AGCCACATAC 


TTGAGCATGA 


GG ATGTTTC C 


ATTGTCCCCG 


660 


TAGGTATTCA 


TGAGATTTCC 


GTAGAGGTGG 


GCAATGTTGA 


GCTGATAGGG 


GTAATTGCCA 


72C 


TCTTTTGAGG 


AAAGTGAAGT 


ATAAACCATT 


AGTTCATCTC 


CTTTCTAACA 


ATCTGACGAC 


780 


TAGCCAGCAG 


TTCACGAAAT 


TC C AGCATGG 


CAGTATAAGT 


TGCCAGAATA 


TAGGCATGCT 


840 


TGCAGTCTTG 


ATTCTCAATG 


GTCTTGAGAA 


CTTGCTCCAG 


ATTACTCGTT 


TCAGTGATTT 


900 


TCTCAGCTGG 


ATAGCCAGTC 


ACTCGGAGGC 


GACGAGCGAT 


TTCAGAATGA 


CGAACACCGC 


960 


CAGCGTTGAT 


TTCAGGAATG 


TCCATGTCAG 


TGATTTGCTC 


AAAGTCTGCA 


TCCCAGATCC 


1020 


AG 












1022 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CCTCCAGGTC TCGGGAAACG ACCATGGCCT TTGTTATTGC CAACGAACTG GGAGTCAATC 60 

98 
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TTAAGCAGAC GTCGGGTCCA GTCATTGAAA AAGCCGGAGA TCTGGTAGCT ATTTTGAATG 
AGTTAGATCC TGGGGATGTA CTTTTTATTG ATGAGATCCA TCGTTTGCCA ATGTCAGTGG 
AAGAGGTGCT TTATAGTGCT ATGGAGGACT TCTACATAGA TATTATGATT GGGGCTGGTG 
AGGGTAGTCG TAGTGTTCAT TTGGAGTTAC CACCTTTTAC CTTGATTGGT GCGACGACTC 
GGGCTGGTAT GCTCTCCAAT CCGCTACGGG CACGTTTTGG GATTACAGGC CATATGGAGT 
ATTATGCCCA TGCTGACTTG ACAGAAATTG TCGAGCGGAC GGCAGATATT TTTGAGATGG 
AAATCACTCA TGAGGCAGCA TCTGAGTTGG CCCTACGTAG TCGTGGGACC CCTCGTATTG 
CCAATCGTCT CCTCAAGCGC GTGCGCGATT TTGCCCAGAT AATGGGGAAT GGGGTAATTG 
ATGATATTAT TACCGATAAG GCTTTGACTA TGCTGGATGT TGACCATGAA GGTTTGGACT 
ATGTGGATCA AAAAATCCTT CGTACCATGA TTGAGATGTA CAGTGGAGGA CCTGTTGGTC 
TAGGAACTCT TTCTGTTGAA TATCGCCGAA GAACGTGAGA CAGTTGAAGA TATGTATGAG 
CCTTACTTTG ATTCAAAAAG GTTTTATCCA TGCGGACACC GTCCTGGACG GGTGGCGACT 
GCCTAAGGCA TATGAGCACT TAGGTTATGA AT AC AG TG AA AAATAAGCAA GAAATCCTTA 
GAGGCTTTTT AGAGAAAATC CAGATATGAT GGCCATTCTG ACGATCATCC GAGACCTTGG 
TCTGAAAGAC TCGTGGTTGG CAGCAGGTTC TGTCAGAAAT TTCATTTGGA ATCTTTTGTC 
AGACAAATCC CCTTTTGATC ATGAAACAGA TATAGATGTT GATTTTC TTT GATCCCAGAT 
TTTTCTTATG AGGAAACCTT ATTACTGGAG AAAAAGCTGT GAGAGGATTT TCCTCAGTAC 
CAGTGGGAAT TGAAAAATCA GGTCTATATG CATCAGCACA GTCCTCACAC TGCTTCCTAT 
ACCAGTTCTC GTGATGCTAT GAGTAAGTAT CCAGAACGGT GTACGACAGT TGGACTGCGC 
TTGAATGAAG AATCAGATTT TGAAGTCTAT GCACCTTATG GTTTGGAGGA TATTTTGAAT 
TTCAAGTTCG TCCAACTCCT CATTTCTTAG AAAATGAAGA CCGAATGGAA CTCTATCAAA 
CACGTTTATC CAAGAAAAAT TGGCAGGAGA AAT 

(2) INFORMAT I ON FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

AC^^ TTCTGGCAAG ™*<^T GAATGTTGAC AATCCG AC CT TCATAAGCCG 
ACAATAAGGT TGGTCCAACG ATTTTCATGT AGCCTGCTAG GCAAACCAAG TCAATCTGGT 
GTTCTTCCAA GAGTTCGACA AGGGCTGCTT CGTAGTCTGT CTTGCTCTCA AACTCCTTGA 
GTTCAAAAGC ATAGGACAGA ACGCCGAGCT GCTTTGCACG CTCAAGCACA TAGGCATCAC 
GATGGTCTGA AAAGACAAAC TCCACTGGAA ATTCTTCGGC AATCACCTGA AAATTTGAGC 
CATTACCAGA GGCAAAAACC GCTATTTTTT TCATTTGATA ATGACACTTT CGTTTTCTTT 
CTTGACGATG CGACCAArTT CATAGACTGC TTCATCCAAC AATTCTTTTA CACGCTCTAC 
ATTTTCAGGG CTGACCGCCA ACATAAGTCC CACACCCATA TTGAAGATTT CAAACATTTC 
TTCGTGTTTA ATCTGACCAT ATTTTTCAAG GGCTTTGAAA ATTGGAAGCA CTGGAACTTT 
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GCTTTCATCA ATTTC AGCAG 560 



(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CTTCAACACT 


TTATTTTTAT 


CTTTCTCCCA 


TTTCGTATGC 


TAGTTCGGAA 


ATTGTTGCTA 


60 


GAACCTCTCC 


GACTATTTGG 


GATGTTCTCA 


TTGCTTTTGT 


AGGAGGGATA 


GCAGGTATTA 


120 


TTGGTGCTAG 


GAAAAAAGAG 


ACCAATAATA 


TTGTTCCTGG 


TGTTGCTATT 


GCAACCGCCT 


180 


TGATGCCTCC 


TCTTTGTACA 


GTAGGTTATG 


CTATTGCTTC 


TGCTAATCTA 


AAATTTATCA 


240 


TTAGGCTCCT 


CTTACCTATT 


CCTCATCAAT 


TGTAGCTTTA 


TTGTCATTGC 


GACTTATATA 


3C0 


GGTGTTAGGT 


TGATGATGGT 


TAAGAAACAT 


TATTTTAAAG 


ATAATGAAGA 


AGACTCTAAA 


360 


ATGCGTAGGA 


TTTTGCTTCT 


AGTTGCTGTT 


TTGCTGATGA 


TTCCGAGTTT 


CATCTCTGCA 


420 


ACGACTTTAG 


TGAGAGAAAC 


GTTGAAAAAA 


GAGTCCCTTA 


AGAAATTTAT 


ATCAGAGCAG 


480 


TTTCAGGGGC 


ATAATATTTT 


GAAAAAAACC 


TATTCTAAAA 


AGACTCATAC 


CCTAAAGCTA 


540 


ACCATTTCAG 


GAAATTATTT 


GACAGAAGAA 


GAACTCGATA 


TGATTTCAAG 


TAAGAGAGGT 


600 


GACTATGGTT 


TAAGTGATGT 


TTCTGTTCAA 


GTTTCACAAT 


TGTCTGATTC 


AGAACAACTT 


660 


AGTAAGGAAG 


AACTGGTGGA 


GTATTTCTTC 


CAGTATATCA 


AGGATAAGGA 


AGCAAAAGAA 


720 


AAGGAAAAAG 


CTAATAAGTT 


TTATACAGAG 


TCTGAGGAGC 


AATAATTTCT 


TGAGAATAGC 


780 


TGGTTTTTCT 


CGTGAGTCTT 


CTATGTATAT 


CAAAGGAAGA 


CTGAGGTTTT 


AAGTATGAAA 


840 


CTTTTCTTCT 


ATTATAGTAG 










860 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TCTTGGCCAA CTGCATGGAG TTCAGCGGTC AATTTCAACG CACCTGAGAA ACAGACCCCT 60 
GCACCCCTGA AATCTCAGGA GACATGATGG TCTGGATGGA ATCAATAATG AG AAAGTC TG 12 0 
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ATAGAGATAA 


180 


GGCAGACTCC 


240 


TTGTAGGAGA 


300 


CTGGTACAAC 


360 


TGACATTGAT 


420 


ACACACGCGC 


480 


CACAGTTGGG 


540 


ATGTCGCTTT 


600 


ACTTGGCAAA 


660 


ACAACTACCA 


720 


ATTTCCGTAG 


780 


AAAAACAGTC 


840 


TGTAATAAAG 


900 




925 
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GCTCTACTCT TAAAGGAAGA CCCGT 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

TTC TNG ATC A AGTCAAGGAC TGCANGCCAC GACCGACATA GCGTTTGGCA ATGGAAACAA 
CCAAACGAAG ATTGGCTTCC GCAAGACGTT GTTTGGCTTC GATATCGCCA GCTTCAACAG 
CCAGTGCCAA CTCTTTCTCC TCTTCATTGG TCAAGAGAGG AACGACCCCT ATTTCTTTCA 
AGTACATACG GACAGGGTCA TTGACCTTAG CAGAAGTTGA CCCAATCAAG TCCTCATCGC 
TGAGTTCTGG TTCTTC TTC A TTGCTAAGAA CACGCGCACT TGGATTTCCT TCGTTATCTG 
TGATAGAAAT GCCTGCATCC TGAATCCGTT GCAAGAGATC TTCAATCCCA TCAGCGTCCA 
AGGTAAAAGG AATAACCAGA CTTGCATTGA TTTCATCATC TGTTGCTGTC CCT-TTTGCT 
TATGATTACG GATAAATTCT GCTACCTGTA CGTCAAATGT TGTTACTTCT TTTTGTTTTG 
TTGCCATTAT TACTCCATTC TTCTCTTTTG GGAAATTAAA CGTTCCAATT CTTCTAGGGC 
TGTATCTGTA TCTCCTACAT GGCTAGCTTC CTGCACCTTC TTTTTGATTC CCATATTQTC 
CTGATTCAAG AGAGCCTTGT TTCGAAGTCA 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



CTAGGGCCAT 


CTTCTCACCT 


CCTCCTTGTC 


AGTACATTCT 


TGCAATAGAA 


AAAATAAGAT 


60 


TGAGTCCCCC 


CAACCTTAAA 


TTTTTTCACC 


ATCTTCTTTT 


TCTTTAGCAA 


TTTGCTCTTT 


120 


GATTTTCTTT 


TCTTCTTCTT 


CTTTGCGGCG 


TTTTTC TTCT 


TCGATACGGC 


GACGCACTGC 


180 


TTCACGTTTT 


CCTTCTGGAT 


CTGGGTGAAT 


TGTAACGTTT 


CCTGATTCGA 


TTTCTTCTAA 


240 


AGCGCGAAGA 


GTTGATTTTT 


CAGACTTGAA 


ACCTTGAGTT 


GCTGGGGCAC 


CTGCTTCCAA 


300 


TTCGTGGGCA 


CGTTTTGCTT 


CCAAGATTAC 


GAGTGAATAT 


TTTGAAGGAA 


CCTTGTC GAG 


360 


CAAGGTATCA 


ATAGAGGGTT 


TTAACATCAT 


TTGCTTGTAC 


CTATTTTCTA 


AATTTTATCG 


420 


GGTAGTTGGA 


GATTTTGGTA 


ACATCTCCTG 


ATAGTGACCA 


ATGACACGAT 


CCACACAGAA 


480 


GTGTTCTGCT 


TCAATCACAC 


ATTTGACACG 


TTCAGCAGCT 


AGGGGTACCT 


GATCGTTGAC 


540 


AATCGCATAA 


TCATACTCAC 


GCATGAGGGC 


AATTTCTTCC 


TTGGCCTTTT 


CGATTCGTTG 


600 


GGCAATCACT 


TCTGCACTAT 


CTGTTC CACG 


ACCTACCAAG 


CGTTCTTGCA 


ATTCATCCAA 


660 


ATCTGGTGGT 


GTCAGGAAGA 


TAAAGACAGC 


ATCTGGAACC 


TTTTTCTTGA 


CCTGAAGAGC 


720 


ACCCTGAACT 


TCAATTTCAA 


GG AAAACATC 


GATTCCCTTG 


TCCAAGGTTT 


CATTGACATA 


780 


GGTCAGAGGA 


GTTCCATAGT 


AGTTGCCGAC 


ATATTCTGCG 


TATTCCAACA 


TCTGTCCTTG 


840 


ACGAATCAG 












849 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 



GCCAAACAAC 


GCATCCATAA 


TGGTTAAATC 


TTGATAAGGC 


AGAGCTACAT 


TCAAAACAGC 


60 


TTGCTGGTTT 


GTAGGCTTTC 


AATCAGAGCA 


ATCACTTCTT 


CAAC CTTGTC 


AGCATCAAGA 


120 


GCAGCTGTCT 


CAATCTTTGT 


ACTTGTTTTG 


CCTTCCAGTT 


TAGCTTTCAA 


GTCATCGCAT 


180 


TTTGACTTGG 


TACGGCTAGC 


AATC AT AATC 


TCTGTAAATG 


TTTCGCTATC 


TTGACAAATC 


240 


TTTGAAATAG 


C AAC TTGGGC 


AACGCCCCCA 


CAACCAATAA 


CTAGTAAACG 


ACTCATTTTT 


300 


TTCCTTCCTC 


TTCTTC TAAA 


ATGTCCTCAA 


CATACTTGGG 


CAACATAAAG 


GCTCCCACGT 


360 


GTAAGTTTGC 


AGTGTAGTAT 


TCTGTGAAAA 


GCTGGCGTTT 


TTTCCAGCCT 


TC CTTGTC AA 


420 


AATCTTTGAC 


AGGGTGGTAT 


TTTTTC G ATG 


C AAATC C AAA 


CAACCAATAG 


CCAGCTGGGC 


480 


TAGTTGGAAT 


ATGGGCCTGA 


TAAACCCGAC 


TGATTGGAAA 


GGCTTGATTG 


ACCTTGCGGT 


540 
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GCATGCTTCG GCAGGCCGAC TCATCCTCGT CAAAGAAAGG A 



PCT/US97/O5306 

581 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


74 : 






TTCTGAAAAT 


CCATACTTCT 


ACCAAGGAGA 


TACGCTAGCG 


GTCTCGGAAG 


TTCTCATACT 


60 


TCTATCGCTA 


TATCTGGCCA 


TCGCCTTTCT 


ATCCAAGGAT 


TGACAACAAG 


AGATAAGGCA 


120 


GAGAAAAAAT 


TCTTGCTGGA 


TCAGCTGGTT 


GCCTGCGATG 


GTGGTACAGG 


TGTCATGCAC 


180 


GAAAGCTTCC 


ATGTAGATGA 


TCCGACCCTC 


TACTCTCGTG 


AATGGTTCTC 


CTGGGC TAAC 


240 


ATGATGTTCT 


GTGAGTTGGT 


CTTGGATTAC 


TTGGATATTC 


GCTAAGGGGC 


TCGCTTTAGC 


300 


TCAACCGATT 


CTTATCAGAA 


TCACAAGTTT 


ACATTTAAAA 


CGTTAAAATT 


TAAATTTAGA 


360 


ATGAGGTTTT 


ACTTCATGGA 


AAATGTTGTT 


GTACATATTA 


TCTCACATAG 


C C AC TGGG AT 


420 


CGTGAGTGGT 


ACTTGCCTTT 


TGAAAGCCAT 


CGTATGCAAT 


TGGTGGAATT 


GTTTGACAAT 


480 


CTCTTTGATC 


TCTTTGAAAA 


TGACCCTGAG 


TTCAAGAGTT 


TCCACTTGGA 


TGGACAAACT 


540 


ATTGTCCTTG 


ACGACTAACT 


TACAAATTCG 


CCCTGAAAAT 


CGCGACAAGG 


TCCAACGCTA 


600 


C ATTG AC GAG GGCAAGCTTA AAATTGGTCC 


CTTTTACATC 


TTGCAGGATG 


ATTATTTGAT 


660 


CTCCAGTGAA 


GCCAATGTCC 


GCAATACCTT GATTGGTCAA 


CAAGAAGCTG 


CCAAATGGGG 


720 


TAAATCAACC 


CAGATTGGCT 


ACTTTCCAGA 


TACCTTTGGA 


AATATGGGAC 


AAGCGCCTCA 


780 


AATTCTTCAA 


AAATCAGGCA 


TTCACGTGGC 


GGCCTTTGGT 


CGTGGTGTGA 


AGCCGATTGG 


840 


ATTTGACAAC 


CAAGTCCTTG 


AAGATGAGCA 


GTTTACGTCT 


CAGTTTTCAG 


AAATGTACTG 


900 


GCAGGGTGTG 


GATGGTAGTC 


GTGTTTTAGG 


TATTCTCTTT . 


GCCAACTGGT 


ACAGTAACGG 


960 


GAATGAAATT 


CCAGTTGACA 


AAGATGAGGC 


CTTGACCTTC 


TGGAAACAAA 


AATTGTCAGA 


1020 


TGTGCGTTGC 


CTACGCTTCG 


ACCAACCAAT 


GGTTGATGAT 


GAAACGGCTG 


TGAACACCAG 


1080 


CCTGTCCCAA 


AAAAAATCTG 


AGCGAAGCCA 


TTCCGTGTGG 


CAAATGAACT 


CTTCCCGGAT 


1140 


GTAATCTTTG 


TTCATAGTTC 


TTTTGATGAA 


TATGTTCAAG 
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(2) INFORMATION FOR SEQ ID NO: 75: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



CTCAAAAAGT 


CTGAAATTGG 


ATCATTTGGA 


ACCATCTGAG 


CAGTAAGTTT 


AACACCTTTT 


60 


TCTGC C AGTT 


TGTGGAAGTC 


TTCCACATCC 


TTGTCTACTA 


CGTTGATAGA 


ACGTGTAATA 


120 


GAACGAGTTT 


CTGGTGTTTG 


AGACATATTC 


CCAACATTAA 


GGGTTTCAAG 


TGGTACACCT 


180 


GCTTC TACCA 


AACCAAGGGA 


AGCGGTCTGG 


TTTACGAGCC 


ACGATAAAGA 


GACGTTGGCT 


240 


ATCGTATTTG 


CCAGCAAGAA 


TATTGGCTGC 


AGCTTTCTCA 


ACTGGCAAAA 


TACTCAATTT 


300 


CAC AC CTGGT 


GGTGTCGCAA 


GTTTCAAACC 


ACTCTTTTCA 


ATATCGTTGT 


TGACAACTTC 


360 


GTCGTCTACA 


ACCATAATGC 


GTGAAACATT 


TAGTTTTCCA 


GCCCAAAGAT 


TGGCTACTTG 


420 


TCCGTGGATC 


AAACGTCCAT 


CAATACGGCA 


TCCTACAATT 


GTCATAAGTT 


TTCCCCCTTT 


480 


ATATGTTTTA 


GTGTAGGTTT 


ACGAGTTAAA 


TGAATCTCTT 


CTTTATATTG 


ACCTTCTGTT 


540 


TCAAAGATAA 


TGATGCGGTT 


GGCACCTTCC 


TTGAGATAG 






579 



(2) INFORMATION FOR SEQ ID NO: 76: 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 73 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i nea r 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



CGCATANCCT 


AAAAGTTTTC 


TTTGACATCT 


TCTGGAATAT 


CTGTATCATT 


AGCTTCCTGA 


60 


ACATCTCTAC 


CATAAATCCC 


AAAGGCTGGA 


ATCCCTTTTT 


GAGCATGTGA 


AGCTAGTACA 


120 


GCTGCAAGAT 


AGACAGCTCC 


TGGGCGTTCT 


GTCCCATTAA 


ATCCCCAAAT 


AGCATGAGGA 


180 


ATATCTGGAG 


ACATATCCAT 


AGTTTCACTA 


CCATAACACC 


AGCATGGTGT 


AACTGTAATT 


240 


GTTGCGCAAA 


CATTTGATTT 


TTTAAACAAC 


TCATGGGAAG 


CTGCAGCCTC 


TGGAACACGA 


300 


C C AATGGT AG 


ATGGAGAAAT 


CACACATTCC 


ACAGGTTCCC 


C ATCTGG AT A 


TTTCAATGTG 


360 


CTTGAAATCA 


AATCTGCCAC 


ACTTTTAGCC 


ATGTTCATTG 


TTTGTACTTC 


AAGTGATTCG 


420 


CGTACACCTT 


GACGACGACC 


ATCAATAGTC 


GGACGAATCC 


CAAT AC GTGG 


ATGTTGAATC 


480 


ATACTATTTT 


CCTCTTATTT 


TTCTGTTTCT 


TTCATACTAC 


CATCGC C ATT 


ATATTTACGG 


540 


TAGCCTGGGT 


GACGACCTGT 


AACCTTATAA 


TTTTCTCGCA 


TTGAGAATAG 


ACGTTCTAAA 


600 


GTCGGACGAG 


CAATTTCTTG 


CTCCTCAATG 


CCCTTTGTAG 


AAAGTAACAT 


TCTTCCGTGG 


660 


AAGGTTGTCT 


TTGCGACTAA 


TTCTAAAGTT 


TCCATACGGT 


AGTATGCTGT 


AATTACATCG 


720 


CTTCCGACAG 


TCAGAG 
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(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 590 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic .DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

CTCATTGTAT TTAGGAGAAA TGATGGTATC TTCCAAATCA AAATCAACTT CACTCCATAG 
TCTCAACGGA TTGATTTTCC CATCTTGATA GGTCACATCC TTGTCAAGGA TAAACTGAGT 
CAACGCCTCA TGCTGACCTT GACACCTGAT GTCATCTACC AAGAGCCAGA CATCCTCTAC 
CAACATGAGG ATTTTTCTCT TGTGAAGATA AGGCAAATCA GGTTCTGCTG ACCAATAAGC 
CCCTTCAATA TAATGCATTC CCTCCCTTTC TTTATGGTGA CAAAACAGGG AGTGAGGATA 
GTATTCATAT TCCCAGGATC CCGTGATTCT TTCCGGAGCT TTCCC ATC T A CAATGCAGGT 
CGAATGACTC CAAGCACTCT TTAAGAGATA ACGTTCATAT ATCTCCCGAT AAGAATAACG 
CCCAGCATCT ATGAAAATAG GTTGGCCTTG ATACTGTAAG CAAAAACTAT TCTCGTCACT 
ATGG CTATGG GCACTTCCTA GCGGACCATT TTTGAAAAAT AGATAACGAT GTTCATCCTT 
AATGCAGACA TGTCCAGAGT CTTCAAAGAT CATGGACTTA GGCTGCCAAG 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 78: 

CTAGTGTAGC TTCCTTAATC TTATCTGATA AGATAGCTGT CATATCAGAC TCAATCATTT 
CCTGGAGCAA TCACATTGAC TCGTATATTC CGACTAGCGA CCTCGCGTGC CACAGACTTG 
GTAAAGCCAA TCAAGCCAGC CTTAGAAGCA GCATAGTTAG CTTGACCAAT ATTCCCCATC 
AAACCAACAA CACTAGACAT ATTAATGATA GCACCTTCTC TGGCTTTCAA TCATCGGTTT 
CCAAAGACTG ATTGTGTCAT ATTAAAGGCA CCAGTCAGAT TCACCTTGAG CACTTTTTCA 
AAATCTGCTT CTGTCATCTT GAGCATAAGA GTATCTTGGG TAATCCCTGC ATTGTTGACC 
AAAACATCTA CTGAACCCAG TTCTGCAATA GCTTGATCAA TCATACGCTT AGCGTCTGCA 
AAATCTGATA CATCTCCTGA AATGGGAACC ACCTTGATAC CATAGTTTGA AAACTCAGCG 
AGCAATTCTT CTGAGATTGC CCCACGACTG TTTAAGACAA TGTTGGCTCC TGCTTGAGCA 
AACTTGTGGG CGATGGCAAG ACCAATTCCA CGACTCGAAC CTGTAATAAA GATATTTTTA 
TGTTCTAGTT TCATTTTTTT CCTTTCAAAA CTTCTACTTA TTTTAGTCTA TTTTTCTAAA 
AGTGCTACTA AACTCGCTTG ATCTTCCACA TGAGCTAAGT GAGCAGTTTG ATCAATTTTT 
TTAACAAAAC CTGACAAGAC TTTCCCCGGT CCAATCTCGA ATAAAGTTGC TTATGCCTGC 
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TTCTTGCATG 


ACCCCAATAC 


TTTCATAGAA 


ACGAACGGGT 


TCCTTGACCT 


GACGCGTCAA 


840 


GAGCTGAGCA 


ATGTCCTCTT 


TTTGCATCAC 


AGCAGCTTCT 


GTATTGCCGA 


CTAGGGGACA 


900 


AGTAAAATCT 


GAAAAACTTA 


CCTGAGCTAG 


AGTTTCAGCT 


AGTTTCTG^O 

«VJ ill X \J\J\- 


TAGCAGGCTC 


960 


AAGGAGAGCG 


GTGTGAAAGG 


GACCTGACAC 


CTTAAGAGGA 


ATCAAGCGTT 


TGGCACCTGC 


1020 


TTCTTGCAAA 


AGTTCAACCG 


CTCGATCAAC 


TGCAACCAC T 


X Vw X V>X\\JV_ rv\ 


X \J — OA XIX vj 


1UOU 


TGCAGGTGTG 


TTATAGTTGG 


CTGGAGTAAC 


CACTCCAAGT 


TCCAGAAGCT 


TTTTGACAGG 


1140 


CTTCTTCAAT 


GACCTCTACT 


GGCGTATTGA 


GAACTGCTAC 


CATCTTGCCA 


AGTTCAGCA 


1199 


(2) INFORMATION FOR SEQ 


ID NO: 79: 








(i) SEQUENCE CHARACTERISTICS: 








(A) 


LENGTH: 1414 base pairs 








<B) 


TYPE: nucleic acid 










(C) 


STRANDEDNESS : single 










(D) 


TOPOLOGY: 


linear 










(ii) MOLECULE TYPE: Genomic 


DNA 








(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 79: 






CTACTATTTT 


CTTTGCCATA 


GCCTTCTCCT 


TT AC AC AC T A 


CGCATATCGT 


GGTAAGAAAC 


60 


ACTGCGTCCC 


ATCTCACCTG 


CATTCTCTTT 


TTGAACAAAG 


GTATTAGCGT 


TTATATAGGC 


120 


AATAGCAGAA 


GCCTTCAACA 


CATCAAAATC 


AAGCCCTGCT 


GCATTAAAGA 


TGGTTTCTGT 


180 


ATCCCTGTTT 


TCAACAGTGA 


CCAACAACCC 


GATCCTGGGC 


ATCGATTC C A 


TCTGTTACCG 


240 


CATTGATAGT 


GTAGGACACC 


AAACGAACAG 


ATTGGTTAAA 


GAACTTATCG 


AT AGC GTTAA 


300 


AGATTGCTTC 


AACGGAACCC 


TGTCCC TGTC 


GCATTAAATT 


CGACTTTCTC 


ACCATCCATA 


3 60 


TTGGCTAGGC 


TAACGAGCGC 


TTCAATGTCA 


TTATCTGCAT 


GAGTTTGAAG 


TTGTAAATCA 


420 


TCAAAGTGGA 


AGCCTTCTGG 


ATTTTCAACC 


ATGGTTCCAG 


CTACCAAAGC 


TCGAGTATCT 


480 


GCATCTGTGA 


TTTCTTACTT 


CTTATCGGCC 


AGTGCCTTGA 


ACTTAGCAAA 


GAATGGTTTG 


540 


ATATCCTCTT 


CTGTAAAATC 


TAAGGCCAAT 


TCTCTCAGTT 


TCTCAACAAA 


AGCATGGCGA 


600 


CCAGATAATT 


TTCCAAGCGG 


AATCTTAACA 


CCAACCAATT 


CAGGTGTGAT 


GATCTCATAA 


660 


GTGAGAGGAT 


TTTTAAGGAC 


TCCATCTTGG 


TGAATACCAG 


ATTCGTGGGA 


GAAGGTATTG 


720 


CCACCAACGA 


CGGCTTTGTT 


TTTAGGAACT 


GGAATACCAG 


AGAAGCGAGA 


AACCATTTCT 


780 


GACGTATTGA 


TGGTCTCATT 


TAGGACAATA 


CTGGTTTCTA 


CTTGGTAGTA 


ATCTTGGCGA 


840 


ATATTGAGAG 


CCACTGCAAT 


TTCTTCCAAA 


GCAGCATTTT 


CAGCTCGCTC 


CCTAATACCA 


900 


TGGATAGTCT 


CTTCAAAAAG 


TCCTGCACCA 


TTCTTGACAG 


CAGCAAGGCT 


ATTTGCCACT 


960 


GCCATCCGAA 


GTTCATCATG 


ACAGGGAGGC 


GAATAGATGA 


TCTGACGATC 


CGTCTGGACA 


1020 


. TTCTCAATCA 


GGTATTGGAA 


GATGGCAC C A 


CATTCCTCTG 


GTGTGGTAAA 


TCCTATATTT 


1080 


TCTGAAAATT 


TCTTCAGTAA 


AGAATATTTA 


GCTAATTGAA 


AGTTCATGAA 


AATTATTAAA 


1140 


ATATTTCATT 


TTTTAGAGGT 


TAAGTTCCAA 


CTTTTTTCTA 


TCAATTCCAG 


TACTTNTTCA 


1200 


TCTGATAAAG 


TATCATCAAG 


GGACACACTA 


ATCCAGTAGC 


GCTTGCTCAT 


ATGGAAGGCT 


1260 


GGATAAATCC 


CC TTTTGTG A 


AAGCAAATTA 


GCTACTTGGT 


CATGCTTGAG 


GTTGACTGCT 


1320 


TCCACTTGTC 


CTTCTCTGCC 


CTTTTCCAGC 


TTATTCCAAG 


AGATTTTCAN 


CAAGACGGCA 


1380 


TACCACTTTT 


GATTGCCTTC 


ATGGCGCAAT 


ACAG 






1414 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



(2) INFORMATION FOR SEQ ID NO: 81 



60 



CTCCCCATTT TGGAAAATTT CTGTCAAGAA ACGGCGAACC AGCTTTTTAT TTTCTGCTTT 6 0 

CTTGTCCAAA TCCTTGATTT CAAAATCTCC AAAAATTTGA TCTAGTTGGT CATTTTCAGG 120 

TGTTCGATAG TAGTCAATGA CATC CC AATG CTCAACAATA CGACCATTCT CATCCGCACG 180 

GAAAGTATCC GTCGTCACCC ATTGAGCTTC TCCACCATTC AGATATTGAT GGAACATGAA 2 40 

CAAAGACCAG ATTGCCATCC TCAATGGTGC GGACAATCTT AATCTGACGC TCTGGATGAC 300 

GCTCAAAGAA ATCTGCAAAG AAGGCTGCAA ATCCTTCTTT CCCGTCAGGA ACACCTGTCG 3 60 

AATGTTGGAT ATAGGTATCC CCTACAGACT GGGCTTGAGC CTCAGCAACT CGTCCGTCTT 420 

GAATGGCATG GATGTATAGG TTGTGAGCAT TTTTCACTTG TTGTGACATA TTCTAAACCT 4 80 

CATTTCCCTT CTCTTTCAGA TTCGCCAAAA TTCTTTCTTG AAAACCTTCA AATTGGTGAA 540 

TTTCTTCCTC TGAAAATCCT TTGTAAAAGA TAGTATCCAA TTTCTGACTG ACACGATGCC 6 00 

CCACTTCTTT CTGGGACTTG CCTAACTCCG TTAAAACTAA ATACTTCTTA CGCTTGTCTT 6 60 

TTCCACACGG ACTAACAATT ACAAGCTTTT GTTCCTCTAG CTTTTTTATC ATAGTCGTCA 720 
GCGTATTATT CGCAAGCCCA GTCGCAAGCG 



750 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



GTATCGTTCA ATTCCAAAGA TACGGAGGAA GCTGCGTTTC GCGAGAAAAC TCTTGTTAGA 
CCAGATAGGG ATTAGCCAAA ATCCTTCTAT GNGCTTATAT CCAAATGGAG ATGCAAACTT 
CTGCCCAACT CTTGACCAAT AAGATCTTTC TTAAAAATCC ACTCAAGGCT CTGGTAGAGG 
AAAAATATGG GATTGAGTAT GAAGAATTTA CCAATCCTTG GCACGCTGCC ATTTCTAGCT 
TCGTTGCCTT TTTCCTTAGA AGTTTGCCTC CAATGCTGTC AGTGACCATA TTCCCAAGTG 



60 
120 
180 
240 
300 
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AATATCGCAT CCCTGCTACT 


GTCCTTATTG 


TCGGTGTGGC 


CCTTCTTCTC 


ACTGGTTACA 


360 


CTAGTGCCAG 


ACTTGGAAAG 


GATCCGACTA 


GAACAGCTAT 


GATTCGGAAC 


CTTGCTATTG 


420 


GTCTCTTGAC 


CATGGGAGTT 


ACCTTCCTGC 


TCGAACAACT 


TTTCAGCATT 


TAGAATACAA 


480 


GAAATACCTC 


GATTTTGAAG 


TCGAGGTATC 


TTTTTTACAT 


TTGCACAATC 


TTGCGATAAC 


540 


TTCTTGAAGT 


AATCATGAAA 


ATCAGCACAT 


AGGCGATGAG 


GAAGATAGCG 


CAGATAGACA 


600 


AGGTCACAAT 


CAACATCATA 


GTCGTATCCA 


GTACACCAAT 


CACTTTTAAA 


ATCAGGCTAA 


660 


GCATATGGTA 


GGCAAAGGCG 


AGATGTATGA 


AGGCAAAGAG 


CAAAGGAAGG 


AAGAAAACAG 


720 


TTAAAACCTG 


TTTGTTGATG 


GTTTGCTTGA 


TTTGCTTTTG 


GTCCAAACCG 


ACTTTCTGCA 


780 


AGATAATAAA 


GCGTTCACGG 


TCTTCGTAGC 


CTTCAGAAAT 


TTGTTTGTAG 


TAGATGACCA 


840 


GAACAGTTCC 


GACCATAAAG 


ATAATGGATA 


GGAAAAT AC C 


GATAAAGAAG 


ACACCGCCAA 


900 


AGAGGACACT 


CATTTGAGCA 


CTAGCATCTG 


CTAGATTGCT 


ACCATAAACA 


TAG 


953^ 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



CTGATTGAAC 


TTCTATTTAC 


TAATATTCAA 


AAATCCTCCG 


TTTCAAAGAG 


CAGGGAACTC 


60 


TTTGTGACAG 


AGGATTTTTT 


CTATAGGGCT 


TTAGCAGCTG 


CAATTGCGGC 


TTCGAAGTTT 


120 


GGCTCAGAAT 


TGATATTATC 


CACGTATTCA 


ACGTAGCGAA 


TCGTATTGTC 


AGTATCGAGG 


180 


ACAAAGACTG 


CGCGTGCTAA 


TAGGTGCCAT 


TCGTTGATCA 


AGAGGGCATA 


ATCGCGCCCG 


240 


AAAGAATGGT 


CAAAGTAGTC 


TGAAAGCATA 


ATGGCATTGT 


CAAGGCCTTC 


AGCACCGCAC 


300 


CAACGTTTTT 


GAGCAAAAGG 


TAGGTCCATT 


GAAACAGTCA 


ATACGACCGT 


GTTGTCCAGT 


360 


CCAGCCAATT 


CTTCATTAAA 


ACGACGTGTT 


TGAGTTGAGC 


AGATGCCTGT 


ATCGATAGAA 


420 


GGAACGACAC 


TCAAGACTTT 


TTTCTTGCCA 


TCAAAATCAG 


CCAGAGATTT 


TTTAGAAAGA 


480 


TCTGTTGTAG 


TAAGAGAAAA 


ATCAAGCGCC 


TTGTCGCCGA 


CTTGTAGTTG 


TTTACCTGTA 


540 


AAGCTCACAG 


GATTTCCGAG 


AAAAGTTACC 


ATAGGATACT 


CCAATCTTTT 


TTCTTCCATT 


600 


GTATCTGAAA 


CAGTCAGAAT 


TTTCCAATGA 


TTTGACCGGA 


AATGTGGGCA 


TAGAAAAAAC 


660 


GCCAG 












665 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 9 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


83 : 






CTAAGTATAA 


AAAACTTGCC 


GAAAGTATCT 


CTGAAGTTAA 


TTTGAATGTC 


TTTTTTCAAA 


60 


ATGCTTTAGA 


TTTTGTTGAA 


ATCTTTAATC 


AAGAGCCTTG 


TCTGATAAAT 


TTTTTTGAGT 


120 


TTATGAATTC 


TATGTTACAA 


GGCGATATTC 


CAAAGTTAAC 


AATAAACCAA 


GAATAATAAG 


180 


AGGGGAACAG 


TATGAAAGAT 


ACGATTTCTA 


ATAAGGATTT 


GATTTCCATG 


GGCTATAGAC 


240 


CATCAACAGC 


AAATGCTATT 


ATCCATCAGG 


TGAGAGAATT 


ACTTGTATCA 


C G AGGCT AT A 


300 


CATTTTATAA 


TCGCAAACGT 


TTGATGGTTG 


TTCCAAAAAG 


TGTTGTGAAA 


GAGTTGTTGG 


360 


GAATGGAATT 


GTGAAATGGC 


TTATATCGAG 


TATAAACAGC 


GTGGGAAGAA 


AAGGCTTTGG 


420 


TCGTTTTCTA 


TACGTGAGAG 


GAGCAAGAGC 


CTACTCCATA 


AAAGCGGATT 


TAAAACAAAA 


480 


CGAGAAGCTA 


AAATAGAAGC 


GGAGAAAGTT 


CTTCATAAGT 


TAAATACTGG 


GAGTGTC TT A 


540 


AGCTCTAGTA 


TGAC TTTATC 


GGAGCTTTAT 


AATGAATGGC 


TGGATTTAAA 


AATTTTACCT 


600 


AGCAATAGAA 


GTGTAGTTAC 


TAAAAAAAAA 


TATCTTATGA 


GAAAGAAGGT 


CATCGAAAGG 


660 


TTATTTGGAA 


ATAAGCCTGT 


ATCACAAATT 


AAGCCTAGCG 


AGTATCAAAA 


AATTATGAAT 


720 


GAGTATGGAG 


AGACTGTATC 


GAGAAATTTC 


TTAGGAAGAT 


TGAATTCTAG 


TATCCAGGCA 


780 


AGTATACAGA 


TGGCTATTGC 


TGATAAGGTG 


ATAATAGAGG 


ATTTTACTGC 


TTATGTTGAG 


840 


TTGTTCTCTT 


CTAAGAGTGG 


ACACAAAGGT 


TGAGGAAAAG 


TATCTACATA 


CTGAGTCAGA 


900 


TTATCAAAAA 


GTTTTAGTAT 


ATTTGAAAAA 


TAAGTTTGAT 


TATCAGAAAT 


CTATTGTACC 


960 


GTATGTAATT 


TATTTC CTTT 


TTAAAACTGG 


CATGCGTTTC 


TCCGAATTGA 


TTGCTCTAAC 


1020 


TTGGGATGAA 


GTTGACGAG 










1039 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


84 : 






CTGAGAAGTC 


AGGAAGGCGA TGTGGAGAGA 


TTGAGGAATT 


GGATACTTAA 


CTCCATAGTA 


60 


ATCTTCGTAA 


AACTCGATAG AGCGAACAGC 


GATATCCAGT 


GAGAAATCAA 


G A TTTGAAAG 


120 


TGGATGTGCT 


TTGGTTGAGT AGACACCCTA 


CCAGGGTACC 


ATTTTTAGTT 


TTAGCGGTCA 


180 


CCCCTTGCAA 


ATCACCAGCA ACAAAGGCCA 


ACAAGTAAGA 


AGACATGCGA 


GGTGTTGTCT 


240 


CAAACTTCCA 


GATACCTGTT TCCTTACGGT 


TTTCAACATC 


GATTTCTGGC 


ATGTTTGACA 


300 


AGGCCAATTC 


ACCTTCTGCT TGGTCAAAGC 


GAAGAGAGAG 


GTCAAAAGTT 


GCTTTGGCTT 


360 


CAGGCTCATC 


CACACATGGG AAAGCTTCGC 


GCGCAAAATG 


GCTCTCGAAC 


TGAGTAGACA 


420 


AGACCTCCTT 


CTTGACTCCA TCAACTGTAT 


AATAAGAAGG 


GTAAATCCCT 


GTCATGTTGT 


480 
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CTGTAATTTT ACCAGAAAAG GCAAGAACCA ATTCAACTTG ACCAGCCTCA GCCAATTCGA 540 

TATGAAGGGC TTCATTGTCA TGGTCAACTG TAAATGGACG AGCTTGACCT GCAACTTCTA 600 

CAGAGGTGAT TTCCAAGTCT TTTTGGTGGA GGGAGATGCG GTCACTCTGT GCTTGACCAG 660 

TGATGGTCAC TTTCCCAGAA AAAGTCTTGG TCTCACGACT CAAATCTAAA AATAAATCAT 720 

AATGTTCAGG AACAAATTGC TTAATAAAAT GTTCAACTGC TTGCATAGTT TTCTCCTATT 780 
CTAAGTTTAA GAG 



(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 42 base pairs 

(B) TYPE: nucleic acid 

< C } STRANDEDNESS : s i ng 1 e 
(D) TOPOLOGY: linear 



793 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



CGGATGATTT 


GCTTTTCATC 


TATGATAAAG 


TCTTTGTTGA 


GCGTCGGAAT 


CTCTACTGAC 


60 


TGGAAATTTC 


CCGTAGATAA 


TCCAAATGCC 


CTTTAAAGAA 


AACCTCATCT 


GTCAACACCG 


120 


AAATCATCAC 


TGCTCCGTTT 


TCTTCATAAG 


TCTGGGCCTG 


TTGCACAATA 


TCCACATCGA 


180 


GATTGATATC 


TCCCCAAAAC 


TAGGGCTAGC 


TTTCTTGACT 


TCAGCGATTA 


CCTGCAAGCG 


240 


GTCCTGATGA 


TTCTTCAAAA 


ATTCTGCCAA 


GCGATAGGTC 


TGGCGCAGAG 


GCTGGATTTG 


300 


CTCCAGCTTC 


ATCTGCTCCA 


CCTCACGCGC 


CTTCTGCTCT 


AAGATTCGTG 


CTAAAAATTC 


360 


CTGACTCATT 


TTTGGTACTC 


CTGTAACAGT 


CTGAGTTTTT 


CAAGGGCCTT 


GCCTCTAGCA 


420 


ATCACTTGAC 


GGGCCAAGGC 


AACTCCTTCC 


TTGATGCTAT 


C AATCTT AC C 


ATTAGCATAG 


480 


AAACCAAGAC 


CAGCATTCAA 


GACTGTCGTT 


TCCAAGAATG 


GACTTGCTTC 


GTTTTTCAGA 


540 


ACGCTAAGCA 


AAATTTCTGC 


ATTTTCCTGA 


GCATTCCCAC 


CACGAATATC 


TTCCATAGCA 


600 


TAGCCTTCCA 


TTCCCAAATC 


CTCTGGAGTA 


AAGCTTGACA 


AG 




642 



(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



CTCTTGCAGG TTATTAAGGA GAAAACGGAG GTAATAGAAG TATGATTTAT ACAGTCACAC 

1 10 
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TCAATCCATC CATTGACTAT ATCGTTCGTT TGGACCAAGT CAAAGTTGGT AGTGTAAATC 
GTATGGACAG TGATGATAAG TTTGCTGGTG GGAAAGGAAT CAATGTCAGC CGTGTCTTGA 
AACGTTTGAA TATATCAAAT ACAGCGACGG GATTTATCGG TGGCTTTACT GGTAAATTTA 
NCNCAGATAC TTTAGCAGAG GAAGAAATCG AGACNCGTTT TGTCCAGGTG GCAGAAGATA 
CTCGTATCAA TGTTAAAATC NAAGCAGACC AAGAAACAGA AATCAACGGA ACGGGTCCAA 
CTGTTGAACC GGTTAAGCTA GAAGAATTGA AAGCTATTTT ATCTAGTCTG ACAGCAGAAG 
ATACAGTTGT CTTTGCAGGT TCAAGTGCTA AAAATCTAGG CAATGTTATC TATAAGGGAT 
TTAATCTCCT TGACGCGCCA GACTGGTGCG CAAGTGGTCT GTGACTTTGA AGGACAGACC 
TTAATTGATA GTTTGGATTA CCACCCTCTT CTTGTAAAAC CAAATAATCA TGAACTTGGA 
GCGATTTTTG GGGTTAAACT CGAAAGTTTA GATGAAATTG AGAAATACGC TCGTGAGTTA 
CTGGCTAAGG GTGCTCAAAA TGTTATTATC TCTATGGCTG GTGATGGTGC CCTTCTTGTC 72 0 

ACATCTGAGG GAG L /2 ° 

733 

(2) INFORMATION FOR SEQ ID NO: 87; 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 536 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 87: 

CTAAGGGAAG TAGGAAAAGT ATGTATCCAG ATGATAGTTT GACATTGCAA CACGGACTTG 
TACCAGATCA ACATGATGCA GG TTTACTTT GATCAAGGGA TTTACAATAA GAAGGCGGTC 
TTTGAGGTGT ATTTCCGCCA ACAGCCTTTT AAGAACGGCT ATGCGGTTTT TGCAGGTTTG 
GAAAGAATTG TGAACTATCT TGAAGACTTG CGTTTTTCAG ATAGTGATAT AGCCTATTTG 
GAGTCGCrTG GTTATCATGG GGCGTTCTTG GATTACCTTC GCAATTTCAA GTTGGAGTTG 
ACCGTTCGTT CTGCCCAAGA AGGGGATT^ GTTTTTGCTA ATGAACCGAT TGTCCAGGTG 
GAAGGACCTC TAGCCCAATG TCAGTTGGTC GAAACGGCTC TTTTGAACAT CGTCAACTAC 
CAGACCTTGG TGGCGACGAA GGCAGCCCCC TATCCGTTTG GTTATCGAAA ATGAACCCTT 
GATGGAGTTT GGGACACGTC GGGCTCAAGA AATGGATGCG GCCATCTGGG AACACG 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
3 60 
420 
480 
536 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



CTGCCCCTGT 


AAGGCTGGAC 


GATTGCCTTT 


CTTAGTATCC 


GCAAAGAGGT 


AAACTGAGAA 


60 


TAGAGAGGAT 


TTCTCCTTCA 


ATATCTTTGA 


CAGACAGGTT 


CATCTTGCCT 


TCTACGTCTG 


120 


AAAAAATCCG 


CATATTGACC 


AGTTTTCTCA 


CAGCATAGTC 


CAAATCTTCC 


TCTTGGTCCT 


180 


CTGGTCCAAC 


ACCAACCAGC 


AATAAAAGTC 


CCTGATTGAT 


TTTTCCCTGA 


ATCTGGC CTT 


240 


CTATACTCAC 


TTGGGCTTTT 


TTAACCCGTT 


GGATAATGAT 


TTTCATAATA 


GCCTTTCTAG . 


300 


TAAGAGCTAG 


GACAACTAGC 


CGTTGGTCCG 


TTTGACAGAG 


TAAACTTCTG 


GCACACTCTT 


360 


AATTTTATCG 


ACAACCGTGG 


TCAGTGTAGA 


GAGGTTGGCA 


ATACCGAAGG 


ACACATGGAT 


420 


ATTAGCAAAC 


TTCATATCCT 


TGGTTGGTTG 


GGCATTGACC 


GTTGAAATAT 


TCTTGGTTGT 


480 


ATTTGAAAGA 


ACTTGCAGTA 


CATCGTTCAA 


CAGTCCTGTA 


CGGTTGAGAC 


CGTAGATATC 


540 


GATATGGGCC 


ATATACTCCT 


TATTTGAG 








568 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



CTTTATATTG 


TACCAAGTAT 


CATTTCTAGT 


GCTGCTATGG 


GCTTGATTTT 


TCTTCAAATC 


60 


TATAATCCAA 


ACTATGGTGT 


TGTTAACCAA 


ATTATTCATC 


TATTTAATCC 


ATCGTTTAAA 


120 


GATTCAGTAC 


TGTTGACTCC 


CAGGATTAAA 


AATAGTAGCT 


ATGACTGGCG 


CTTATATCTT 


180 


CTTTGCAGGA 


GCATCAACCA 


TTATGATTTC 


TTGGGCAAAT 


TTTTGCTATT 


C C AG AAG AAG 


240 


TTCAAGAAGC 


TGCTATTTTA 


GACAATATTA 


CTGGTTGGAG 


GAAAGAGTGG 


TATATTACGA 


300 


TTCCGATGAT 


TAAGGGGACA 


ATTAAAACTG 


TTTCAATTAT 


GGCAGCAACT 


TCAGGATTTT 


360 


TGCTCTATAA 


CGAAGTATTC 


TTTTTGACAA 


ATGGTGCTGC 


AGGAACAAAA 


AGTATCAGTT 


420 


TTGTTATTCG 


AGAATTAGCA 


GTGGCTAGCT 


CACGAACTCA 


GTATGCTCGT 


GCAAATACAA 


480 


TTGGAGTTAT 


ACAAATCTTA 


GGTGG AATGT 


TGATTATCGT 


TTGTATTAAT 


ATTTTATTCA 


540 


GAGAAAGAAA 


AAG AC TGAAA 


GGTGGGAAAT 


GATTATGAAT 


ACACATATAA 


ATGGTATTAG 


600 


TAAAAAAGGC 


AAAGTTCTTA 


TATATGGTTA 


TATGCTCCTT 


ACCATTTTAA 


TTTCTATTTT 


660 


CCCTATTGCG 


TGGATTTTTT 


TATCATCATT 


AAAAGCAGAT 


CCTATGAAAA 


ATCCAGGTAT 


720 


TAGTTTACCG 


ACTGACTTTA 


CTCTTGAAGG 


TTATATAAAT 


GTTTTTACAA 


AACTTCATGT 


780 


TTTTACTTAC 


TTTTGGAATA 










800 



(2) INFORMATION FOR SEQ ID NO: 90: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 



CTCAGTATTA 


TCCCTACCAA 


AATGAACGTC 


AGTTAGTGAC 


TCAGGTGGTT 


TGGGAACAAT 


60 


GGGCTTTGGA 


ATTCCAGCAG 


CAATCGGTGC 


TAAAATTGCT 


AACCCAGATA 


AGGAAGTAGT 


120 


CTTGTTTGTT 


GGGGATGGTG 


GTTTCCAAAT 


GACCAACCAG 


GAGTTGGCTA 


TTTTGAATAT 


180 


TTACAAGGTG 


CCAATCAAGG 


TGGTTATGCT 


GAACAATCAT 


TCACTTGGAA 


TGGTTCGCCA 


240 


GTGGCAGGAA 


TCCTTCTATG 


AAGGCAGAAC 


ATCAGAGTCG 


GTCTTTGATA 


CCCTTCCTGA 


300 


TTTCCAATTG 


ATGGCGCAGG 


CTTATGGTAT 


TAAAAACTAT 


AAGTTTGACA 


ATCCTGAGAC 


360 


CTTGGCTCAA 


GACC TTGAAG 


CTACTACTGA 


GGATGTTCCT 


ATGCTAATTG 


AGGTAGATAT 


420 


TTCTCGTAAG 


GAACAGGTGT 


TACCAATGGT 


ACCGGCTGGT 


AAGAGTAATC 


ATGAGATGTT 


480 


GGGGGTGAAG 


TTCCATGCGT 


AGAATGTTAA 


CAGCAAAACT 


ACAAAATCGT 


TCAGGAGTAC 


540 


TCAATCGCTT 


TACAGGTGTC 


CTATCTCGTC 


GTCAGGTTAA 


TATTGAAAGC 


ATCTCTGTTG 


600 


GAGCAACAGA 


AGATCCGAAT 


GTATCGCGTA 


TC AC T ATT AT 


CATTGATGTT 


GCTTCTCATG 


660 


ATGAAGTGGA 


GC AAATCATC 


AAACAG 








686 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTTCCCAGC TATTTTACTG AGTATAAAAG CCTTATTTAA CTTACTCTAT GTACTCGGTT 60 

TTCTAGGAGG AATGTTGGGA GTTGGGATTG CTTTGGGGTA CGGAGTGGCC TTATTTGACA 120 

AGGTTCGGGT GCCTCAAACA GAAGAATTGG TGAATCAGGT CAAGGACATC TCTTCTATTT 180 

CACAGATTAC CTATTCGGAC GGGACGGTGA TTGCTTCCAT AGAGAGTGAT TTGTTGCGCA 240 

CTTCTATCTC ATCTGAGCAA ATTTCGGAAA ATCTGAAGAA GGCTATCATT GCGACAGAAG 300 

ATGAACACTT TAAAGAACAT AAGGGTGTAC TACCCAAGGC GGTGATTCGT GCGACCTTGG 360 

GGAAATTTGT AGGTTTGGGT TCCTCTAGTG GGGGTTCAAC CTTGACCCAG CAACTAATTA 420 

AACAGCAGGT GGTTGGGGAT GCGCCGACCT TGGCTCGTAA GGCGGCAGAA ATTGTGGATG 480 

CTCTTGCCTT GGAACGCGCC ATGAATAAAG ATGAGATTTT AACGACCTAT CTCAATGTGG 540 

CTCCCTTTGG ATCGAAATAA TAAGGGACAG AATATTGCAG GGGCTCGGCA AGCAGCTGAG 600 
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GGAATTTTCG GTGTAGATGC CAGTCAGTTG 
CTTCCACAGA GTCCCATTAC TTACTCTCCT 
TGAAGACCTA GAAATTGGCT TAAGACGGGC 
GGTGCATTAA GCAAAGACAA TT 



PCT/US97/0S306 
ACTGTTCCTC AAGCAGCATT TTTAGCAGGA 6 60 

TATGAAAATA CTGGGGAATT GAAGAATTGA 720 
TAAGGCATTC TTTACAGTAT GTATC CT AC A 780 

802 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



CTACTATTTT 


AGCATAAAAA 


TGCCCAAAGG 


GGGNGCCGTG 


TGTTTACTGA 


TTTTCAGGNT 


60 


AATGG AC CAG 


GNAAATCAGC 


ATGAAAATAA 


AAAGAGAAAC 


AGATTATTTT 


AGCCATTTGT 


120 


CAGATTTATG 


CTATGCTTAA 


GGTAGAAAAT 


GAAAGGGATA 


ACAAATGTAT 


TTAGGAGATT 


180 


TGATGGAGAA 


AGCCGAGTGT 


GGTCAATTTT 


CAATCCTTTC 


CTTTCTATTA 


CAAGAGTCTC 


240 


AGACGACCGT 


CAAGGCTGTA 


ATGGAAGAAA 


CAGGATTTTC 


AAAAGCAACC 


CTAACCAAAT 


300 


ATGTCACCCT 


GCTCAATGAC 


AAGGCTTTGG 


ATAGTGGCTT 


AGAACTGACT 


ATTCACTCAG 


360 


AAGATGAAAA 


TCTGCGTCTG 


TCTATAGGTG 


CAGCTACTAA 


GGGGAGAGAT 


ATTCCGGAGC 


420 


TTGTTTTTTG 


GATAGTGCTG 


TTAAATACCA 


GATTTTGGTT 


TATCTTCTCT 


ACCACCAACA 


480 


GTTTTTAGCC 


CATCAGCTGG 


CTCAAGAATT 


GGTGATTAGC 


GAGGCTACGC 


TTGGTCGTCA 


540 


CTTAG 












545 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

GAGGTCAAGA TTGATTCCCA AGGTTNGAGG AATCTGCGGA ACCAGATTTA GAAGAAATCG 60 

AGCGGTTACC ATGCTTGGCC ATGTGAATAC CGCCACCAGC CAAGACAAAG GCTGCAGTTG 120 

TGGAAATATT AAAACTGAAA GACTTGTCCC CACCTGTACC ACAGTTGTCC ATAGCATCAT 180 

GAATCTCAGT TGGAATATGC TGGGCATGTC CTCTCATGAC TTGGGCAATG GCTGTGCGTT 240 
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CTTCAGGTGT TTCCCCCTTC ATC TTAAG AG CTAAGAGGAG AGAAGCAATC TGCGCTTCAG 3 00 

TTACACGCCC AGTTACGATA CGCTCAATGA CATCCGTCAT TTCCACACCT GATAAATTTT 360 
CAAATTTTGC TAGTTTTTCA ATAATCTCTT TCATCCTAGT TTCCTCACTT TACAACCTCC 42 0 

TCGATAAAAT TCCGAATAGA AGACAAGCCG TCTGGCGTTC C AATG CTCTC TGGATGGTAC 
TGGAAGCCAT AAATCGGTAG GTTTTTATGT TGAATCCCCA TGATGGCTTG GTCATCAGTC 
GAACGAGATC AAGCTTATCG ATACCGTNGA CCTCGA 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

ATTAAAGGGA GGCGAACATG GCCCAAGATA TAAAAAATGG AGAAGTAGAA GAAGTTCAAG 
AAGAGGAAGT TGTGGAAACA GCTGAAGAAA CAACTCCTGA AAAGTCTGAG TTGGACTTGG 
CAAATGAACG TGCAGATGAG TTCGAAAACA AATATTCTTC GCGCTCATGC AGAAATGCAA 
AATATCCAAC GCCGTGCCAA TGAAGAACGT CAAAACTTGC AACGTTATCG TAGCCAGGAC 
TTGGCAAAAG CAATCTTACC ATCTCTTGAC AACCTTGAGC GTGCACTTGC AGTTGAAGGT 
TTGACAGATG ATGTGAAGAA GGGCTTGGGG ATGGTGCAAG AAAGCTTGAT TCACGCTTTG 
AAAGAAGAAG GAATTGAAGA AATCGCAGCA GATGGCGAAT TTGACCATAA CTACCATATG 
GCCATCCAAA CTCTCCCAGC AGACGATGAC CACCCAGTAG ATACCATCGC CCAAGTCTTT 
CAAAAAGGCT ACAAACTCCA TGACCGCATC CTACGCCCAG CAATGGTAGT GGTGTATAAC 
TAAGATACAA AGCCCGTAAA AAGCTCGCAG TAAAAATAGG AGATTGACGA AGTGTTCGAT 
GAACACAAGA AAATCTANCT TTTTTACTCA GAGCTTAGGG CGTGTTCGAT TCGGCAATTC 
TGACGGTAG 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



480 
540 
576 



60 
120 
18C 
240 
300 
360 
420 
480 
540 
600 
660 
669 
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CTAGTTCACC 


ATTTCTATTT 


AAATTCWjL 1 


CAAC T AC AC C 


GAAATATCAi 


ATAALK* 1 AAA 


o u 


GTTCCCACTA 


AGAATCTCAG 


AATGAAATTC 


TTC ACTAAC T 


TCAGCGGTCA 


TV tv /TYVT"' iv /^r>»rvri 

AALTTAvjL 1 I 


1 z u 


ATCCAATTTC 


ft ft /^o/ - * m m ft m 

AGACCGATAA 


ACCCATACTT 


C AG AGAT AC T 


TCCTCTTGAA 


ATTTAGALj 1 A 


loU 


ACTCTTATTA 


GACATTTGAT 


GTATCGTTAT 


CGGGCTATTA 


/— » TV TV fT^ 7V /"» t**f*r^ 

GAAGGAGCCC 


GATTTAAAAC 


24 U 


TTCTTTATTC 


ATGAAATTTC 


TCCTTCAAAA 


AACAAAACAT 


TCTATATACT 


TAAATTTTAG 


300 


GGCAAGACTG 


CCGTTCCACT 


GACCACGCTC 


C AC TTGCC AG 


TV TV RPOTV^r'^ 

CAAAGCTGGC 


AGGTC C ACjC C 




TT AC CC T AAA 


TATAAATTAT 


TTAATTTTTG 


CCATTTCAAT 


CAATTCTGCA 


AAGGTCCCAA 


420 


GCTTTCTGGA 


TTGAATTACC 


TCATATTGGG 


CATTCTGTCC 


AAATGAAGCA 


TTATTAATCA 


480 


CTTCAACCTC 


ATAAAGGCCT 


GGAGCAGAGA 


AAACAGCTCC 


AAATTTGGAT 


TTATCGAAAT 


540 


GCATACCTAG 


GTCACTCCAA 


CAATCATTAA 


GGGTGGAGAC 


TTTTATCCAG 


CCTCGTTCTT 


600 


TCTTAGTAGA 


AATAG 










615 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CTACTGTAGT 


TGGAACTGTT 


GAAAAACAAG 


GAAAACAAAA 


GAAAGTTGTT 


ACTTACAAGT 


60 


ACAAACCTAA 


AAAAGGTAGC 


CACCGTAAAC 


AAGGTCACCG 


TC AAC CAT AT 


ACAAAAGTTG 


120 


TCATCAACGC 


AATTAACGCT 


TAATTTTAAG 


GAGAACACAT 


GATACAAGCA 


GTCTTTGAGA 


180 


GAGCCGAAGA 


TGGCGAGCTG 


AGGAGTGCGG 


AAATTACTGG 


ACACGCCGAG 


AGTGGCGAAT 


240 


ACGGCTTAGA 


TGTCGTGTGT 


GCATCGGTTT 


CTACGCTTGC 


CATTAACTTT 


ATCAATTCTA 


300 


TTGAGAAATT 


TGCAGGCTAT 


GAACCAATCC 


TAGAATTAAA 


CGAAGATGAA 


GGTGGCTATC 


360 


TGATGGTTGA 


AATACCAAAA 


GATCTTCCTT 


CACACCAGAG 


AGAAATGACC 


CAGTTATTCT 


420 


TTGAATCATT 


TTTCTTAGGT 


ATGGCAAACT 


TATCGGAGAA 


CTCTTCTGAG 


TTCGTCCAAA 


480 


CCAGAGTTAT 


CACAGAAAAC 


TAACACGGAG 


GAAAACATTA 


TGTTAAAAAT 


GACTCTTAAC 


540 


AACTTGCAAC 


TTTTCGCCCA 


CAAAAAAGGT 


GGAGGTTCTA 


CATCAAACGG 


ACGTGATTCA 


600 


CAAGCGAAAC 


GTCTTGGAGC 


TAAAGCAGCT 


GACGGACAAA 


CTGTAACAGG 


TGGATCAATC 


660 


CTTTACCGTC 


AACGTGGTAC 


ACACATCTAT 


CCAGGTGTAA 


ACGTTGGTCG 


TGGTGGAGAC 


720 


GATACCTTGT 


TCGCTAAAGT 


TGAAGGCGTA 


GTACGCTTTG 


AACGTAAAGG 


ACG 


773 



(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



V- i Va A r AGGAT 


TGTGGGCGAT 


60 


i I TAATTCGA 


CCCCGTGAGC 


120 


AAAGCAAGGG 


GGCGTTTGCC 


180 


GCCACATGGA 


CATGGAGCGG 


240 


AAGTCTCTAC 


TGCAGCTATA 


300 


TTTTATATTT 


TAAGATTTCG 


360 


TTTCTGTCTC 


TGAAGAAAAG 


420 


CCACCTGATA 


AATTTGCTGG 


480 


TTGTTCCTGA 


CTGGAGCATC 


540 


GAGTAAACTC 


AGATTCTGCT 


600 


TCATCTCGGA 


TCCCTCTCAG 


660 


GGCATAAGCC 


AAG 


713 



(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CTAGAAAATC TGCTTTTTTC TCGAAAAGTT TAATAC TTGG TACAATATTA CGCTTGCTTT 
GCTCGGCTGC TCCAAGGGCA ATTTTTTCTA GTTTTTCAAC TTTTTTACCC AATTTCTTGC 
CATCCCACTT GGCAACTGAC CAGTCTGCAG GAAAGGCCCA GATTTGGCTA GCACCCAGTT 
CGGTTACTTT TTGAGCGATG AACTCCAGCT TGTCTCCCTT GGGAAATCCA GATGCGATGG 
TCACTTGGAC TGGTAGTTCC ACATTGTCAT TTAATTCTTG GACCAACTCA 
TTTCCATATC CAGCACGCGC GCCAAGCGCT TGATGCCATC ATCAAAGACT AAGGTAACCT 

catcctcttc tttcaagcgc ataacctgaa acatatgctt actggtttcc ttgtc^cgI 

^ZZ AGG AGAGATAGCA CTGCC ™ CAAAATACTG CTGCATGCTA GCCTCC^C 
A ^ AGA TATCCTTGGT "TCTTAAAG ACACAGGTAT TCCATTCCCC TTGAACCATG 
TGAGTTTCGA GGAAAAATCC AGCTGATTCA GCCGACTCGC GCACCATGTC CCACT^GC 
CTTGAATAAT GCCACTCATG ATCAGGTAGC CTTCGTCCTT AACCAAACGA TAGGcItCGT 
CTATTAGATG AATGAGGATA TCCGCCAAGA TATTAGCCAC AATCACATCT GCCTCAATTT 
CCACACCCTT AAGCAAATCT CCAGCCGCTA CATGGATATT TTCCATGCCA GGG TTGAG^^ 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
778 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



CTTCCCAGAT 


TTTTTCGTTG 


ACTTGCTGTT 


CAATCGCACG 


CAATTCTTCA 


GCAGTTACAG 


60 


CTTGGAAGTG 


GGTAAAGTCA 


AAGCGAAGGA 


ATTCAACTTC 


GTTAAGAGAT 


CCTGCCTGTG 


120 


TTGCGTGGTT 


TCCAAGGATA 


TTGTGAAGGG 


CAGCGTGAAG 


CAAATGAGTC 


GCAGTGTGGT 


180 


TTTTCATGAC 


ACGGTGACGG 


CGATTGCTAT 


CAATTGCCAA 


GGTATATTCT 


TGGTTCAAGG 


240 


CAAGCGGTGC 


AAGGACTTCA 


ACTGTATGAA 


GGGCTTGACC 


ATTTGGGGCT 


TTCTGAACAT 


300 


TGGTCACAGT 


AGCCACAACC 


TTAC CTG ACT 


CATCCAAGAT 


TTGTCCGTAG 


TCAGCTACCT 


360 


GTCCACCCAT 


TTCAGCATAA 


AATGACGTTT 


CCCGCAAAGA 


TAAGAGAGGC 


AGTTCCTTCT 


420 


GAAACAGCTT 


CTACTTCTGC 


ATTGTCCGCC 


ACGAATAGCT 


ACCAATTTAG 


AAGACAATTG 


480 


GCTAGCATTG 


TAGTTGAAGG 


C AC TTTCT AC 


AGTGATGTTT 


TGAAGAGTTT 


CCATTTTGCA 


540 


TACCCATTGA 


GCCACCCTTG 


ACAGCTGACG 


CACGCGCGCG 


TTCTTGTTGT 


TCTTTCATGG 


600 


CTGCTTCAAA 


ACCTTCACGG 


TCTACAGTCA 


TACCAGCTTC 


TTCAGCGATT 


TCTTCAGTCA 


660 


ATTCAACTGG 


GAACCCATAA 


GTATCATAGA 


GTTTGAAGAC 


ATCTGAACCA 


GCGATAACAG 


720 


ATTGACCTTT 


TTCTTTCAAG 


TCTGCTACAA 


TGCCTTGGGC 


AAAGTGTTGA 


CCTTGAGTGA 


780 


AGGGTACGGG 


CAAATGATTC 


TTCTTCGCCT 


CTTAACGATT 


TTCTCAATAA 


AGTCACGTTT 


840 


CTCAAGCACT 


TCTGGGTAGT 


AGCTTTCCAT 


G ATTTTTC C A 


ACAGTTGGAA 


CGAGTTTTGT 


900 


AAAGGAAAGG 


CTCGTTGATA 


CCCAATTTTT 


GACCCATGCA 


TAGAAGCACG 


ACGGAGAAGA 


960 


CGACGAAAGG 


ACATAACCAC 


GACCCTCCAT 


TTCCTGGAAG 


GGCACCATCC 


ACCGATGGCA 


1020 


AATGAAAGTG 


AACGGATGTT 


GGTC CGCGAA 


TGAACCTTGA 


AT 




1062 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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CTCAAAGGTC GCACCTGCAA TACGACCCGC TACAGGACCT ACACTTGCTC CATGC TTGGG 
ACTATTGCCT ACATAACTAC CAAAGTCATC AAATCCCAAG ATAACATTGG CAAAATTTCC 120 
AGCCTTGTCA GGTGCGACAT AGCGCAAGAT AGTCGCACCA TAAGTCATAA CCTCAAGTTG 180 
GTAGCCACCG TCTGTCTCAA ATCGATAGGC CAAGACATCC TCACCCTCAA CATTTCCAAA 24 0 

TACACGCTCT GTGTATGCTT TCATTCTGTT CTCCTTTTAC TATTTCTCTC AAGCAAACAA 
ACCATAGAAA GCGTACTGAC AATCTATGGT TTATCTGATA ATTTACAAAT CCTCTTGTCA 
AGAATTCATA AACACTGTCT TACTTTTGAT ATTCGTGAAT TATGACACCT TGTACTACAC 
GGTTTACTGT ACCTGTAGGA GACGGTGTAT CTGGTTTATT TTC TAC CTTG AGTGAAGTCA 
ATAGGGCAAA GAGTTGGGCA TAAACGATGT AAGGGAAGAC ACGGTAAATA TCATTCAAGA 
CACCGCCACA ACCAAGGGCC ACTTCTTTGA CATTTTCAAG ACCAAAAGCT TGATCACTCA 
AAAGCACAAC ACGACGAGCA ATCTGGTCAC CAGCAACTTC ACGAACCAAG TCCAAGTCGT 
ACTTACGAGT GTAGTCCGTC CTTGTACCAA AGAACAAAAC AACTGTATTG TCGTTGATAA 
GATATTTGGA CCGTGACGGA ACCAACTGGG CTTTCATACA TGGTCGCAAC TTGAACAACA 
GTTAATCCCA AATCTTGAGC TGAGCCTCAT GAGCAGTCCA AAGAAGACCA GCGCCTAGAA 
GAG AC CGGTT AAGTCTTAAT CACAG 



(2) INFORMATION FOR SEQ ID NO:101: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 94 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
865 



; GCGTTAGCAG 


GTTTGATACG 


60 


. TGAAATCGGC 


TCTATCCAAC 


120 


' TATTTCCTTC 


CACGTTGATG 


180 


' GACAGAACGG 


GTGGATTGCG 


240 


TGGCTTTGTC 


TATATCAAAT 


300 


GGAGCGAGAT 


TATCGTTCGA 


360 


TCTCCGTTTA 


TCTATGGAAA 


420 


AGTGATTCAC 


CAAGCTCTTC 


480 


CTTTGCACCT 


CATATTCTGA 


540 


CGCCTTTGAA 


GACTATGATA 


600 ' 


AAAACCAGCC 


GGTACCTTGA 


660 


GCGTCTTAGC 


CTAGACTTGG 


720 


AAAATTGATT 


TACAATCAAA 


780 


ATTATGATTC 


GCTACGGAGA 


840 


TAAACTTCGT 


AATAATATTT 


900' 


AGATCG 




946. 
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(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



CTTGCTCGTA 


ACAGGTGCTA 


TCCTTGGTGT 


CAATGTTCAC 


ATCTTCTCAC 


CAAAAGAACT 


60 


CTTCCCAGAA 


AAAGAAATCG 


TTGAATTGGC 


AGAAGGATTT 


GCTAAAGAAA 


GTGGCGCACA 


120 


TGTTCTCATC 


ACTGAAGATG 


CTGATGAAGC 


AGTTAAAGAT 


GCAGACGTTC 


TTTACACAGA 


180 


CGTTTGGGTA 


TCAATGGGTG 


AAGAAGACAA 


ATTCGCAGAA 


CGTGTAGCTC 


TTCTTAAACC 


240 


TTACCAAGTC 


AATATGGACT 


TAGTTAAAAA 


AGCAGGCAAT 


GAAAACTTGA 


TCTTCCTACA 


300 


CTGCTTGCCA 


GCATTCCACG 


ATACTCACAC 


TGTTTATGGT 


AAAGACGTTG 


CTGAAAAATT 


360 


TGGTGTAGAA 


GAAATGGAAG 


TAACAGACGA 


AGTCTTCCGC 


AGCAAGTACG 


CTCGCCACTT 


420 


CGATCAAGCA 


GAAAACCGTA 


TGCACACTAT 


CAAAGCTGTT 


ATGGCTGCTA 


CACTTGGTAA 


480 


CCTTTATATT 


CCTAAAGTAT 


AATTTTAGAT 


AATAAACCGT 


CTACCAACAG 


CTATGAGGGC 


540 


TGCGACTAAT 


AG 










552 



(2), INFORMATION FOR SEQ ID NO: 103: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 



CTAAAAAAAG 


TCAAGTAGAA 


AACGAATATT 


CTACTTAACT 


TGCACGAAAT 


TATTTTTCAC 


60 


GAATGACTTC 


GACCTTATAT 


CCATCAGGGT 


CTTTGACAAA 


GTAATAGTTT 


GGTGCAGTTC 


120 


CTGGTAGACC 


ATTTGGCTCA 


GTCACTTCAT 


AGCCTTTTGT 


ACTGTGCTCT 


TGATGAAGTG 


180 


CCTCAAGATC 


AGGTGT AC TG 


AGGGCGATAT 


GGGCAAACCC 


ATCACCAACC 


ACATACGGAC 


240 


CGTGATCGTA 


GTTATAAGTC 


AACTCCAACT 


CATAGTCATC 


ACCCTCAAGA 


CCTAGATAGA 


300 


CAATCGTGAA 


GGCATGGTCT 


GGAAAATCTC 


TGCGACGCAA 


TTCTTTAAAA 


CCAAAAGCAT 


360 


CTTGATAAAA 


TGCAATTGAT 


TTTTCAAGAT 


TTTCTACTCG 


TAAGCAAGTG 


TGTAGCATTT 


420 


TTGAAGCCAT 


ATCTTTCTCC 


TTT ATTTTT A . 


AAAAGACTGG 


ACAATCCTGT 


TCCAGTCTCA 


480 
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TCAGTTGTTA TTTACCAAGT TTTGCTTTAG CT 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

CTGGACCACA CTATTTTCTG TGTTGGCTAT CGTGTCATGC AAAAGGATCT AGAAGGGACG 
CTGGATGCTG AAAAACTCAA GGCTGCTGGT GTTCCGTTCG GCCCGCTtS TG^^tc 
AAAAACGGCC AGGATCTTGT TTTGGAAGAC GGAACTGAAA TCAAGGCAGC 
TCAGCGCCAC GTCCAGGTAA GATTATCACT ATTTTAGGAG ACACTCGAAA AACGGATGCC 
AGTGTGCGTC TGGCTGTCAA TGCAGATGTC CTAGTTCATG AGTCCACTTA ^GgS 
GATGAAAAAA TTGCTCGTAA CCATGGTCAC TCAACTAATA TGCAAGCTGC ACAAGTAGCG 
GTAGAAGCAG GTGCCAAACG CCTCCTACTC AACCATATCA GTGCCCGTTT CC^Z 

— GCTGCC ACAATTTTTG AAAATGTCCA TGTGGTCAAA 

ATTAACCGGA ™^ ^™ 340 

551 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

ZlZrZT ACCTTTGTCT A ^AAAATGA AGACCTAGTC TATGAGGAGG AAGTCTTATG 
TTCCTTTTTG S= !!™~ — » CAAGAAGGTC 



512 



TTCCTTTTTG Trrr.r,^ AAAAAGCGGG CAAGAAGGTC 

A !!^!!r GTGTGGCGAT TGTCGTTATA TTTATCCTGC CTTACCAGAG 

™ AGA ^ CACCTTT ATTCGAATGG ACCGAGATCA GtIt^GGAT 
TTGGCCAAAC TCTGGGATGT TTACGGAATT CCTAGCCTTG TTGTTC TAG A AAAP^ 
G = TC GTTTTGTCAA TCGCGACCGT AAAAGTAAGG ££££ CGATTTTTTA 
GCAGGATTGA AATAGGAGAA AAAGGAAACA ATGATTTTTA CGTATAACAA AG^gTc 



60 
120 
180 
240 
300 
360 
420 
480 



60 
120 
180 
240 
300 
360 
420 



121 
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GGTGATGTCC TTATGGTCAT CGTGAAAAAT AGCGGAGATG CCAAACTGAA TGTGGAACGC 
AAAGGCAAGG TAGCCCGTGT TTTCCTCAAA GAAAATGGGG AAACAGTAG 



480 

529 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



CTAGGATAGA . 


CCGCTTTTTA 


GCATTTATCT 


AAGCATTCCA 


GAGTACATGT 


ATCTTGCATG 


60 


TGCTCTTTCT 


TTTGGGGTTG 


AAACGATAGG 


.AGAAGGAAAT 


GTTAGAATTG 


CTTAAATCAA 


120 


TCGATGCTTT 


TGCTTGGGGA 


CCGCCCCTCT 


TGATTTATTG 


GTCGGAACAG 


GGATTTACCT 


180 


AACCATGCGG 


CTAGGACTCT 


TGCAGGTTTT 


GCGTCTGCCC 


AAGGCCTTTC 


AGCTTATTTT 


240 


TATCCAGGAT 


AAGGGACATG 


GTGATGTATC 


CAGTTTTACA 


GCTTCTTGTG 


TAACAGGCCC 


300 


TTGGGCATTC 


AAACTGGTTG 


GGAAACAGGG 


AAATATCCAT 


TAGGGAGTTG 


GCGAC GGCTA 


360 


TCAAGGTTGG 


TGGACCAGGA 


GCTCTATTTT 


GGATGTGGAT 


GGCGGCTTTC 


TTTGGAATGG 


420 


CTACCAAGTA 


TGCGGAAGGA 


CTCTTGGCCA 


TCAAATACCG 


CACCAAGGAC 


GACCATGGTG 


480 


CAGTAGCGGG 


AGGTCCCATG 


CATTATATCC 


TTCTAGGGAT 


GGGAGAAAAG 


TGGCGACCAC 


540 


TTGCTGTTTT 


GTTTGCAGTA 


GCAGGAGTAT 


TGGTTGCTCT 


CTTGGGAATC 


GGAACCTTCA 


600 


CCCAAGTCAA 


CTCGATTACA 


GAATCTATCC 


AAAATACAAC 


GACGATTTCG 


CCAGCCATCA 


660 


CAG 












663 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

CCTTGACCAC CAATTCTTCA CCCGATATGA TGCGCCCTGC TCGCTGAGCA AGCCCCAAGA 
GATTACTTAT CTTTTGCTTA TTCAAGTCCC AACTCTCTTC TTTTCACTTT GTGATCCACA 
TAAGCGATCA ACTCGTCATA AAAGCTTTCT TCCACTTCCA TGCTAAAGCT GCGGTTAAAG 
ACCTTCTTCT TTTTCGCCTC TAGGGCTTCT GCATTGTCTA GTTTGATATA AGCGCCGCGG 
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CCATTGGCCT TGCCCGTAGG ATCAATAAAG ACTTGTCCTT CCTTGTTCTT GACAATGCGG 3 00 

AGCAAATCAC GCTTATCAAT CACTTCGTTA GACACAACAG ACTTGCGCAA AGGGATTTTT 3 60 

CTTGTTTTCA TCTTTCCCTC CTCTAGCAGC TTTTATTCTT CTACAGTATC GTTTTCTACT 420 

TCCAACTCTA CTGAAGCAGC GTCTTCCATG GCTTCAAATT CGCTAGCAGA CTTGATATCG 480 

ATACGGTAAC CAGTCAAGTG AGCCGCCAAG CGCACGTTTT GTCCACGACG ACCAATGGCA 540 

AGAGAAAGCT TGTTATCTGG AACAACCACC AAGGCACGTT TGCTGTCGTT TTCATCAAAG 6O0 

ATAACTTGGT CAACCTCAGC AGGAGCGATG GCATTGTAGA TAAATTCAGC TGGATCTGCT 660 
ACCCACTCGA TAACATCGAT ATTTTCTTCG ATTGGTACCA TGCGGTCATT TTTAGCATCG 
TAACGAG 



(2) INFORMATION FOR SEQ ID NO: 108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



720 
727 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


108: 






CTAATGTAAA 


TCTATGGGAT 


AAACGTTAGC 


TTCTTTTAAT 


AGATTATTAA 


GCAATTTTTT 


60 


AAAACAACTC 


ATCAAACAAA 


CTCAACTGGT 


TATCTCTGGC 


ATATTTCCAA 


GAATACCCAT 


L20 


CTCATCCATC 


TTTTCAACCA 


AGGTTGATGA 


GAGTCCACCA 


CGCTTGCGTA 


GTTCTGTTTT 


180 


AGAGAGGAAT 


TCTCCCTCTT 


CACGCGCCCG 


CACCAGTTGC 


TTGGCAACGT 


TCTCCCCCAG 


240 


ACCATCCATT 


GCTACAAATG 


GTGGGATAAG 


GGTATCCCCG 


TCGATGAGGA 


ACTCTGTCGT 


300 


CTGACTACAG 


TAGAGATCTA 


ATTTACCAAA 


CTTGAAACCT 


CGTTCCCACA 


TCTCATTGAC 


360 


AATCTCAAGA 


GTTGTATAGA 


GATCGATTTC 


CACATTAGAG 


GCTTCATTGT 


TCTTCCGTTT 


420 


TTCAGAGATT 


TCTTCCATTC 


TGCGCTTGAT 


GACCTCCAAG 


CCCGCACCCA 


TGGTC TTG AT 


480 


ATCAAAAGCC 


TTAGCACGAA 


TGGAGAAGTA 


AGCACAGTAG 


TAATAAATAG 


GATGGTGAAC 


540 


CTTGAAGTAA 


GCTACACGCA 


AGG 








563 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) 



SEQUENCE DESCRIPTION: 



SEQ ID NO: 109: 



PCT/US97MS306 



CTAACTTCAA 


AAGCTTTTAG 


AATATCTTCT 


TGACTAATCA 


TATGATTACC 


GCCCAAGGTT 


6 0 


AAGTTTTCCA 


AAATAGAGCC 


ATTAAAGATA 


TAGGCTTGTT 


GGGGTAGGTA 


ATTAATATGA 


X yJ 


CGGCGCAAGA 


CTTTTTTTAT 


CAATGTTTTT 


AATATCCTGA 


TGATTGATGG 


AAATATGCCC 


180 


TTTGTAGGGT 


TCAAAGAAAT 


TGACAATCAT 


TTTGGCTAAA 




CAGAACCACT 




AAL 1 LLAAL i 


AviVi L- 1 AAL L, 1 


lAlV. ILL i i Li 


111 AA 1 L.Lj 1 Lj 


ALr A 1 I AA 1 A 1 


CTGTTAAGGT 


3 00 


ATCTCGTCCA 


AAACCATACT 


TATAAGAAAG 


GTCATCAAAT 


TCAATATCGC 


CCATCAAAAA . 


360 


ATGTGAATGA 


ACAGGGTTTT 


CTTGAACTTG 


AAATTCAGAT 


TCGACTAGAT 


AGACTTCGTT 


420 


CAAACGGTTA 


TTAGCGACCT 


TCGCAGATTG 


GAGTTTGGTT 


TGGAGGTTGA 


TAATATTTTC 


480 


CATAGGAGTT 


GTAAAGT AAG 


AAAGAAGTGT 


GTTAAAGGTA 


ATCAGCTGAC 


CGATAGAAAT 


540 


TTTACTCGAC 


ATGACTAATT 


GAGCGCCAAA 


CCATAGGATA 


AGGATATTCA 


GAACTAATTT 


600 


TGTTTCCCCT 


GCTTTAAACT 


CGTTTGTAAA 


ATAGAATATT 


TACTGAG 




647 



(2) INFORMATION FOR SEQ ID NO: 110: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110 : 



CTATCCTNGA 


ACCAATGATG 


CTTGTAACAA 


TCACTGTTCC 


AGAAGAAAAC 


CTTGGTGATG 


60 


TTATGGGTCA 


CGTAACTGCT 


CGTCGTGGAC 


GTGTAGATGG 


TATGGAAGCA 


CACGGTAACA 


120 


GCCAAATCGT 


TCGTGCTTAC 


GTTCCACTTG 


CTGAAATGTT 


CGGTTACGCA 


ACAGTTCTTC 


180 


GTTCTGCATC 


TCAAGGACGT 


GGTACATTCA 


TGATGGTATT 


TGACCACTAC 


GAAGATGTAC 


240 


CTAAGTCAGT 


ACAAGAAGAA 


ATTATTAAGA 


AAAATAAAGG 


TGAAGACTAA 


TCCGTCCTCA 


300 


CTCTAGAAGG 


AAGTCACTTA 


GTGGCTTCCT 


TTTGTC TTTA 


GAAAATACCT 


CTAAATATGG 


360 


TAAAATAGTA 


GAAGAATAAT 


GTGAGGAAAA 


TGAATGTCAA 


ATAGTTTTGA 


AATTTTGATG 


420 


AATCAATTGG 


GGATGCCTGC 


TGAAATGAGA 


CAGGCTCCTG 


CTTTAGCACA 


GGCTAATATT 


480 


GAGCGAGTTG 


TGGTTCATAA 


AATTAGTAAG 


GTATGGGAGT 


TTCATTTCCG 


TATTTTCTAA 


540 


TATTTACCTA 


TTGAAATCCT 


TTTTACAATT 


AAAGAAAGTT 


TGAGCGAAGA 


ATTTCCTNAG 


600 


AANGC CATC A 












610 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 

124 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

TATAAGCATA GTACCAGTTG CCATTGACCT GGAACCAACT GTCTTCATAT CTCC ATTAC T 
GCATTTAGGT AGTACCAAGT TGAACCATCT TGATACCAAC CAGTTGCCAT TGCTCCTGAT 
GAACGGAGAT AGTACCATTT GTTCCCAAGG TTTTGCCAAC CTGTTTTCAT ATCGCCATTT 
GGCTGGTCTA AATAATACCA AGTGGTACCT TCCTGATACC AACCAGTTGC CATTGCTCCT 
GATGAACGGA GATAGTACCA TTTGTTCCCA AGGTTTTGCC AACCTGTTTT CATATCGCCA 
TTTGGCTGGT CTAAATAATA CCAAGTGGTA CCTTCCTGAT ACCAGC C AGT GGCC ATTGC T 
CCTGAGGAAC GGAGGTAGTA CCACTTATTA CCTAGATATT GCCAACCTGT TTGCATAATA 
CCAGTTGTTG GATCTAAATA GTACCAAGTC AAATCATCGT TTATCCACCC CGCACGTCTT 
TCACCACCAA GGTAGTTTTC TCCATTAATT TCCGTCTTAG CTAGATAATA CC AG TAG AC T 
GATCATAAAG CCAACCTGTC TCTAAAGAAT GATTTTGATT AAAGTAATAG TTCGTATAA- 
AACGNTTCTC TTCTTTATCT TCTGAATCTT CACGTTTTTC CCCGTACTTT CTTCCAACAC 
TGTCTTTAGT TTTAATCTCT AATGTTTTCC AAC C AAC AAA CTCTTGTAGC ACTCCATTTT 
TATCGAAGTA GTACCACTCT GAATTTGGAA AAC CTTCTAA TCTCATACCA TTTGGGTAAG 
GACCAATTGT ACTACCTTTA GATGGAAACG GGATATATTG CCAGCCGACA ACCATCTCTC 
CAGATAGAGA ATCAAAATAA TAGTACTTAC CATCAATCAC TCGCCAGTAG GTTTCTTTGA 
GGTCCCCCTT TTTGTAATAG GTTCTACCGT TTTCTTGGAC AAACTGCCAT CCTTCAGAGT 
TATCTGCAAA TACTGTATTC GTAG 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
984 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 1 1 2 : 

CCTCGTTATG CAGATGAACG TTATTTCTTG TCAAAGAGTC ACAAGAATTT TGTTGATCGT 
AA^CTTTTTA TTACCATTCG TGACAAGGAA ACCACC^TA TCAAGCCTTA TCAGCAGGAT 
TTGGATTTGC CACATGGTCT GGCCTTGGAT GTTTTGCCTT TGGATTATTA TCCGAAAAAT 
CCAGCTGAGC GGAAAAAACN GGTTCGTTGA GCCTTGATTT ATTCACTCTT TTG^^ 
ACTATTCCAG AAAAGCATGG TGCTCTCATG AAATGGGGAA GTCGCATTTT ACTGGG™ 

125 



60 
120 
180 
240 
300 



i 
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ACTCCAAAAT 


CTCTCCGTTA 


TCGLA 1 v- x \jO 


a a a a a arrTn 


An A A An A A AT* 


n AnT* A AnT 1 AT 




GATTTGGCTG 


ATTGTGATGO 


TV H"«H"< TV ^ 7A P* A A 


A TPrTP a n 


vjILLI VjvjV_ 1 J\ 


n a Tn ap* a a An 




AAGTACCCAA 


TCACATCTTT 


TGAAbALAA 1 


P* ' 1 "1 " 1 ' r i i P*T7 

t 1 1 i ILi IvjL. 


n A r r r r r rn a Ann* 


a An An An a Tn 


4 fin 

4 O VJ 


CCTATTCCAA 


TCGGC TATG A 


TGl V. 1 A 1 I v~ 


An a anY!PTT 


r PT*nnnnA p n r rA 


t ATn arnprT 


j4U 


CCACCAGCAG 


ACAAGCAGG 1 


a nnnn a Tr* a n 


Un X v7V_ io I ^fl 


TCCZHTCZ AT AT 1 


nn at a AnTnT 


rnn 

O v VJ 


TATACAGAAT 


ACAAGGGAGA 


AT ATGGTGG C 


I AAGAAAAAA 


ATCTTATTTT 


TTATGTGGTl 


boU 


TTTTTCTCTT 


GGAGGTGGTG 


CAGAGAAGAT 


TCTATCAACC 


ATTGTTTCAA 


ATCTGGATCC 


720 


AGAAAAGTAT 


GATATTGATA 


TTNTTGAAAT 


GGAGCACTTT 


GACAAGGGAT 


ATGAATCTGT 


780 


TCCAAAGCAT 


GTACGCATTT 


TAAAATCCCT 


TCAAGATTAT 


CGCCAAACCA 


GATGG AT AC G 


840 


AG 












842 



(2) INFORMATION FOR SEQ ID NO: 113: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



CTGGCAAATA 


CAAGGTGACG 


ATC ATTGGTA 


AATCAGCCCA 


CGGTGCTATG 


CCTGCTTCAG 


60 


GTGTCAATGG 


TGCGACTTAC 


CTAGCCCTCT 


TCCTTAGCCA 


GTTTGACTTT 


GCTGGTC C AG 


120 


C C AAAGAATA 


CCTTGACATC 


ACTGGTAAAA 


TTCTCTTGAA 


CGACCATGAG 


GGTGAAAGTC 


180 


TCAAGATTGC 


TCATGTGGAT 


GAAAAGATGG 


GTGCCCTTTC 


TATGAATGCA 


GGCGTCTTCC 


240 


GCTTCGATGA 


AACAAGTGCT 


GATAATACCA 


TTGCCCTCAA 


CATCCGCTAT 


CCAAAAGGAA 


300 


CAAGTCCAGA 


ACAAATCAAG 


TCAATCCTTG 


AAAACTTGCC 


AGTTGTTTCT 


GTTAGCCTGT 


360 


CTGAACACGG 


TCACACGCCT 


CACTATGTTG 


CCAATGGAAG 


AATCCACTTG 


GTTGCAAACC 


420 


TGGTTGAAAT 


GTCTATGAAA 


AACAGACAGG 


CCTTAAAGGT 


CATGAACAAG 


TCATCGGTGG 


480 


TGGAACCTTT 


GGTCGCTTGT 


TAGAGCGCGG 


AGTTGCCTAT 


GGTGCTATGT 


TCCCAGACTC 


540 


AATTGAT AC C 


ATGCACCAAG 


CCAATGAATT 


TATTGCCTTG 


GATGATCTCT 


TCCGAGCAGC 


600 


AGCAATTTAT 


GCCGAAGCTA 


TTTACGAATT 


GATCAAATAA 


AACGATAGAA 


GTCTGAGATC 


660 


TTATGCTTGG 


ACTTCTTTTT 


GGAGGGAAAG 


TAGATGTCTC 


AAATCGAAAG 


AATCAAACAG 


720 


GCTATCATGG 


CGGATTCACA 


GAATGCCAG 








749 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



WO 97/37026 



PCT/US97/QS30<S 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 



ac^TrT GAAGCG CAACCTAC ^ GGATTTGTCT TCCAAGATTT TCAACTATTT 

CCTCATCTAT CAGTTCTGGA AAATTTGACT TTATCGCCTG TGAAGACCAT C^lZlr 
CAGGAAGAGG CTGAGAAGAA GGCGAGTGGA CTCTTGGAAC AGTTAGGACT ^T^r 
GCAGAGTCCT ATCCTTTCTC ACTATCTGGT GGGCAAAAGC ACCCCG^c t"^ 
GCTATGATGA TTGACCCAGA AATCATTGGC TACGATGAAC CAACTTCTGC C™TCcI 

gaatt acgtt tggaagtgga gaagctaatc ttgcaaaata gggaacScS gat^cccaJ 

ATTGTGGTTA CCCATGA-r™ rr,^ UGG GATG ACC C AC 



60 
120 
180 
240 
300 

ATTGTGGTTA CCCATGATTT GCAGTTTGGC tgaaZT^T UUUMCTTOa ^ACCCAC 360 

TAGAACCTCA AATAGGAAGA AAAATGGATT GAAAAAATGG ATGCTTGTAT "° 
GAAGACTGCC TTTGTTCTTA GTA AAAAAATGG ATGCTTGTAT TAGTCAGTCT 480 

503 



(2) INFORMATION FOR SEQ ID NO : 115: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 



CTGGAGGGCA TTCAGTATTA CTTGAATAGA GGAAAACATC ATCTCCCTTA TGTTCTCAAT 
ATTGGATTTC CTGGTCAGAA AAATGACCTC TTACTCCTTC GGCTArZZ TGTTCTCAAT 
TCAATCTCTA CTr^rvr*nr. ^ ^AUCCTTC GGCTAGATTT AGCTGGAATT 

=i™ = =~ =E = =s 

TAACTATGAA CTGACTCGCC ^OO^l TrZ TTTTTTGAAA CTAAGGTTGG 

s= — SEE — — 

CTTCCTCATG gItAG AGACCACCAT ATTCATCAGG AAAAATTCTA 



(2) INFORMATION FOR SEQ ID NO: 116 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCT/US97/OS306 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CTGCAATGGT TAGCGAGGAT GGTGAAATAG AGGAGCCTTA ACTATTTTGA GGAATCCAAG 60 

CTGACAGGAC TGGTATGTCT GCTCTTTGGC TCTGAAGGCA GACTATACCA GCTGGAAGTT 120 

TGACAGATGT GGCACGTGAG AATTATTGGA GGCNTTGACN TANTCAGAGG ATTCTCTCTT 180 

GATATTGGGT TTGGCAGAGT TGGATAGTGA GTTGGAAAAT TACCAAGCGG TTATTCAAGC 240 

CTATGCCCAG TTAGATAATC GCTCGATTTA TGAGCAAACG GGCATTTCCA CCTATCAACG 3 00 
AATTGGCTTT GCCTATGCTC AGTTAGGGAA ATTTGAAACG GCTACTGAGT TTTTAGAAAA . 3 60 

AGCCCTGGAG TTAGAATACG ATGACTTAAC AGCTTTTGAG TTGGCCAGTC TTTATTTTGA 420 

TCAAGAAGAA TATCAAAAAG CCACCCTCTA CTTTAAGCAG CTTGATACCA TTTCTCCTGA 480 

CTTTGAAGGC TATGAGTATG GGTACAGTCA GGCTTTACAT AAGGAACATC AAGTTCAAGA 54 0 

AGCCCTGCGT ATCGCTAAGC AAGGATTAGA GAAAAATCCC TTTGAAACTC GCCTCTTGCT 600 

AGCTGCTTCA CAATTTTCTT ATGAATTGCA TGATGCTAGT GGTGCAGAAA ATTATCTCCT 6 60 

TACTGCAAAA GAAGACGCTG AGGATACAGA AGAAATCTTG CTTCGTTTAG CCACTATTTA 72 0 

TCTGGAGCAG GAGCGTTATG AGGATATTCT AG AC TTGC AG AGTGAGGAGC CAGAAAATCT 78 0 

TTTGACCAAG TGGATGATTG CTCGTTCTTA TCAAGAAATG GACGATTTGG ATACTGCTTA 84 0 

TGAGCATTAT CAAGAGTTGA CAGGAGATTT GAAGGACAAT CCAGAATTTC TGGAACACTA 900 

TATCTATCTC TTGCGTTGAA TTGGGACATT TTGAAGAAGC AAAATCCCAT GCTC 954 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



CTATGAAACA 


TTTTGATACT 


ATTGTCATCG 


GTGGGGGACC 


TGCTGGTATG 


ATGGCT AC G A 


60 


TTTCCAGTAG 


CTTTTATGGA 


CAGAAAACCC 


TCCTCATCGA 


AAAAAATCGG 


AAACTTGGAA 


120 


AAAAATTAGC 


TGGGACTGGT 


GGGGGACGTT 


GCAATGTGAC 


CAACAATGGT 


AGCTTAGACA 


180 


AC CTGCTAGC 


TGGAATTCCT 


GGAAACGGAC 


GCTTTCTTT A 


CAGTGTTTTC 


TCCCAGTTCG 


240 


ATAATCATGA 


CATCATCAAC 


TTTTTTACAG 


AAAATGGTGT 


TAAACTTAAG 


GTCGAAG AC C 


300 


ACGGACGCGT 


CTTTCCAGCC 


AGTGACAAGT 


CTCGGACTAT 


TATCGAAGCT 


TTGGAAAAGA 


360 


AAATCACTGA 


ACTAGGTGGT 


CAAGTTGCTA 


CTCCAAATAG 


AAATCGTTTC 


TGTTAAAAAA 


420. 


GTAGATGACC 


AGTTTGTCCT 


TAAGTCAGCG GATCAAACCT 

128 


TC AC TTGTG A 


GAAACTCATT 


480 
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GTCACAACAG GTGGTAAGTC TTATCCTTCG ACTGGTTCGA CTGGTTTTGG TCACAATGCT 540 
CCGCCATTTA AGCATACCAT CACG . 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 118; 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 



CTTAGTATCT AGAAAAGGAG AAATAAAATG GTTAAAGTAT TAGCAGCGTG CGGAAATGGA 
ATGGGTTCAT CAATGGTTAT CAAGATGAAG GTTGAAAATG CTCTCCGTAA GCTTAATCAA 
ACAGATTTTA CAGTCAATTC ATGCAGTGTC GGTGAAGCTA AAGGTTTAGC AGTAGGATAT 
GACATCGTAA TCGCTTCTCT TCATTTGATT CAAGAATTGG AAGGGCGAAC TAATGGGAAG 
TTAATTGGGC TTCATAACTT GATGGAPGAT AAAGAAATCA CCGAAAAACT CAGTCAAGCA 



564 



GCAAGTCCAT 


TTTATCAGCA 


60 


CAATCACTTC 


TACTTCAGGA 


120 


GGGTACGAGT 


TTCTGGATAG 


180 


TACCAGATGC 


CCAAGTGAAG 


240 


GGTGGCTAGC 


GATATCTTTA 


300 


TTCTTTAATT 


CCAGTCCTGT 


360 


CCTCAGCTTG 


GGTAATGGGA 


420 


CAGCAGGTGT 


CATGACTCGT 


480 


GTGTCC CG AC 


AGATAAAAAT 


540 


GAGCCGAGCC 


AAACCAACAC 


600 


GTTGAATC C A 


C 


651 
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ATACAGTAAA AGGTTGGAGG GGGCTGGACA CAAACTGAGA GTTATCGTTT CTGTCCTTCT 
CCCTCTTTAA ATAAAGGAGG CAGATATGAA TTTAAAACAA GCTTTAATTG ACAACGACTC 
GATCCGACTA GGTTTAGAAG CTAACGAATG GAAAGAAGCA GTCAAGGTAG CAGTAGATCC 
CTTGATTGAA AGTGGGGCAA TTTTGCCAGA GTATTACGAT GCTATCATTG AATCGACTGA 
AGAGTATGGG CCTTACTATA TCTTGATGCC AGGTATGGCT ATGCCCCACG CTAGACCTGA 



480 



420 



360 



600 



540 



AG 



602 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 



ATTGGGAATT 


TTTGCAACTG 


CAATTGGTGC 


CCTCAGTAAT 


CTATAAAATA 


GATTCAAGAA 


60 


AATTTAGTGA 


CTGGGATTTC 


CCAGCCCTTT 


TTTAAAGTGA 


GAAGAAATAA 


TGAGTATGTT 


120 


TTTAGATACA 


GCTAAGATTA 


AGGTCAAGGC 


TGGTAATGGT 


GGCGATGGTA 


TGGTTGCCTT 


180 


TCGTCGTGAA 


AAATATGTCC 


CTAATGGAGG 


CCCTTGGGGT 


GGTGATGGTG 


GTCGTGGAGG 


240 


CAATGTGGTC 


TTCGTTGTAG 


ACGAAGGACT 


ACGTACCTTG 


ATGGATTTCC 


GCTACAATCG 


300 


TCATTTCAAG 


GCTGATTCTG 


GTGAAAAAGG 


GATGACCAAA 


GGGATGCATG 


GTC GTGGTGC 


360 


TGAGGACCTT 


AGAGTTCGAG 


TACCACAAGG 


TACGACTGTT 


CGTGATGCGG 


AGACTGGCAA 


420 


GGTTTTAACA 


GATTTGATTG 


AACATGGGCA 


AGAATTTATC 


GTTGCCCACG 


GTGGTCCGTG 


480 


GTGGACGTGG 


AAATATTCGT 


TTCGCGACAC 


CAAAAAATCC 


TGCACCGGAA 


ATCTCTGAAA 


540 


ATGGAGAACC 


AGGTTCAGGA 


ACGTGAGTTT 


ACAATTGGAA 


CTTAAA 




586 



(2) INFORMATION . FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121; 



TTCGACAATC TTGCCCNNGA TCCAAGACAA CCACTTCTCT GTCCGCTCAG CAATAGTCAA 
GCGGTGAGCA ATGAAAATCA AAGGTCTTGT CCAAAGCCAT GAGATAATGG ACAATCCGTT 

130 
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TTTTTGTCAA AATATCCAAA CTGTTAGTCG CCTCATCCAA AATCAAGACC GGCGCATCTG 
TCAAGAGAGC ACGCGCCAAA GCGATTCTCT GACGTTGACC ACCTGAAATC CCTGCCCCAT 
CCGAAGTCAA TTCTGTCTGG TAAATTTCAA GGTGGCATGG CGGCTTCGAG ATATTCCTCT 
TCGGAATCTC TGAACCAAAT TCGGACCGGC CCGTTAAAGA TATCTTCCCT GTGTCGTCCC 
Za^ZT TCCCAAAAGA AGATTC — AAATCGTTCC GTTAAAGACA TA^CTcS 
GAGGCAGATA GATGATGTAC TGGCGTAGGG CTTTTTTATC AATCTGATTG AGATTGACAC 
CACCCAGACT AATCTCCCCT TGACTTGGGT CGTAAAAATT AACCATCATC ^CCA^G 
TCGTCTTACC TGACCCTGAA ATCCCCACAA AAGCCACCTT AGACCCTTGG GGAACGGTTA 
AATTGATATC f-nn™ r,^.^ m WjAACGGTTA 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

CCAATTTGGA TTTGCGAGAA GGCGTTATTA TTTCAGATGC TGGTTCGACC AAGTCAACCA 
mmXM TATTTGGCTG GCAAGTCTGT TCGCTTTGTC G^CC^C 
CCATGGCTGG TAGTCACAAG ACAGGGGCTG CTTCGGCAGA TGTCAATCTT TTTGAAAATG 
CCTATTATAT CTTTACAACC TTCCAAAGCC CTGAACAAGT CAAGGACAAC GCTTAAAGGA 
AATGGGAAAG GATCTGCTTT CCAGGTCTTC ATGGCTCGTT TTATCGAGAT £££££ 
CAGCATGATC GTGTCACTTC TCAGATTAGC CATTTTCCTC ATATTTTGGC T^T^C 
ATGGAGCAGA CTGCGGTCTA TGCTCAAGAG CATGAGAATG GCAAGGCGCT T^CcSaGG 

"ZZ CGA GATATGACCC GAATTGCGGA AAGCGAGCCA GGAATGTGGA CCTC^CT 
ZZZ AGCGAGACCA TTCTGGATAG AATTGAGGAT TTCAAGGAAC GT^Hgc 

GATTGGTCAG GCCA^TAGTA AGGGAGATGA AGAGCAAATT TGGAACTTTT TTAACCAAGC 

(2) INFORMATION FOR SEQ ID NO: 12 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



C TGGCTC ATC 


AGGGGCAGGA 


TATTTATGAT 


TTCCCTCGTG 


CTATGATTAA 


GGAAGATAAT 


60 


C TGGAGTTTT 


C A i 1L. 1U 1 vjVj 


111 urtiitt 1L 1 


Vj^-V-. 1 1 1 Alv-A 






X u 


CAAAAGGGAG 


AAAGTCTGTC 


TACAGAGGAT 


TTGTGTGCTT 


CCTTCCAAGC 


AGCTGTACTG 


180 


GATATTCTCA 


TGGCAAAAAC 


CAAGAAGGCT 


TTGGAGAAAT 


ATCCTGTTAA 


AACCCTGGTT 


240 


GTGGCAGGTG 


GTGTGGCAGC 


CAATAAAGGT 


CTCAGAGAAC 


GCCTAGCAGC 


CGAGGTTACA 


300 


GATGTCAAGG 


TCATCATTCC 


ACCTCTGCGC 


CTCTGCGGAG 


ACAATGCAGG 


TATGATTGCT 


360 


TATGCCAGTG 


TCAGCGAGTG 


GAACAAAGAA 


AACTTTGCAA 


ACTTGGACCT 


CAATGCCAAA 


420 


CCAAGCCTCG 


CTTTTGATAC 


CATGGAATAA 


AGAGTGGCTC 


TTTGTCAAGT 


GTAGTGGGTA 


480 


GGCGAAAAGC 


TACAATCTGG 


AGANTACGAA 


ATTC 






514 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 4: 



CTTGGATTAT 


AAAATACAAT 


CGGCAAGATC 


ATAAGTCCCA 


AGCCTAAAAA 


TATATAGAAA 


60 


GGGGGTC AC C 


TTCCAAAGAA 


ATTCTGTATT 


AAAGAGCATG 


ACCACAAAAC 


CAATCACAAG 


120 


CCCCAAGGCA 


ATCCAGGCGA 


CCTGCTGCCC 


TAAAATGGGC 


AGAATATTAT 


TGGGGTAATC 


180 


ATGACTAACG 


GCTATATAGA 


TAGCCACCAC 


ACCGATGACC 


AGTAGAAAAA 


ATACTGGCAA 


240 


GAGCAAACTG 


TAATCGACTC 


TTGAGTCGAG 


AGAACGTTTC 


ATATAAACTA 


AC C TT AT AC T 


300 


TTCATACAAT 


ACTATTTATC 


AAAGTTCATT 


AAAAAATCTA 


TCAATAGCCT 


CGTCAACTTC 


360 


GGATCGAGAG 


ATGGTTTTAA 


CAGTCGCTTC 


TTCTGCTAGA 


GATGCTACTA 


TTTGTTTGCC 


420 


GTATCGTTTT 


CCGACGATTC 


TCCTATCCAA 


AATAAGAGTT 


AAGGAACGTT 


GGTATTCACG 


480 


TCTCATACTT 


CTTCCCAAAG 


CCTGTTTTAA 


ACGAATAATG 


G 




521 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

132 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 5: 

ZZrT^ GACATCATTC *=TCTTTTTG GTATATCATC GACCATTTCT TAAAAAATGT 
CC^r GAAGAAGAAC — ^ CA ATTGCTTAAT AACCAAGGAA A^^ 

ccAcrrrTCA agtcaacacc tccctacagc cattgatttt gactttaacc atcctt^gI 

CCCTCGTTAT CCCCCAAGAG TACTGGTTTT AGACATGGAC GGTAGAGAAA CTATCC^CT 
AGAGTTTTTP — AAACT CTAGCCTTCA CTTCCAAC^ ZZ^cl 
IZrlZl TA "" TTTTCA ^ATCATA CAAGTTGCGG ATCGGTTGTT TTAATATCGG 
ATGGATAAAA TGAGGCGCAA TTTCCTGTAA GGACTCAAGG ACAAAAAGGC GTTCCCGCTA 
TGTTAAGGGA TGAGGCAATA TJGAGGTCGTC TGTATAAAGG ATCTGGTCCT cSc^ 
CCAAGTCCCA AATCAATCCA AACGAAGTCC CCCAATGCAC TTCTCtScc CG^C^ 
TCTTCACTCC AATGGCTAAC AAGGTTTCNT AACAATTCTT GTTGCTGGTA GCC^C 

(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



CTATGGGTAA AAAGGATGCC TCTGCTATrr htp,^^,^ 

^ 1A1(jC ATGAGATGAG GGCTTCrTTT 
CCATAGAAGC AGGCCATACT GCGTAW!^ ^ ^CTTCCTTT ATTCAAGGCT 

~ =: EE EF = = 

=s s= ssS BP = ~ 

TTAAGGGGCT CCAGCAAGGA ACAAGTCTAT CTATATAGGT TTGAAATCGA 

— ™^ ATTACCTGAG 

tt^^ gttggtcttt tcoattcatt "HZ^ r™"" 

CTTAGCTATC TATTGAATTG TGAA ^^AAAT CGTCCAAAAG TATTTAATAA 



(2) INFORMATION FOR SEQ ID N0:127 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOFOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 127: 



ACTAATCACT 


TACTACGTAG 


CTTCTGACTA 


TCTACCTCTA 


CTAGATAAAG 


GACAAACTGT 


60 


AAGATTAAAA 


CTGGAGAAGN 


TTGG AAATC A 


CGGCATTACC 


ACCATCGGCC 


AACTTCAGAC 


120 


AATTGATCAA 


ACTCCTACCA 


GAACAGAGCA 


AGGCAATCTC 


TTTAAATTAA 


CCGCTCTTGC 


180 


AAAACTATCT 


AATGAGGATA 


GTAAACTCAT 


CCAATATGGC 


TTACAAGGTC 


GCGTCACTAG 


240 


TGTAACTGCA 


AAGAAAACAT 


ATTTTGATTA 


TTTCAAAGAT 


AAAATTTTAA 


CCCATTCTGA 


300 


TTAATTTTCA 


GATAACACTC 


TATAACTATT 


TATTATCTTA 


TCAAAAAGGA 


GAATCATAAC 


360 


ATGGATAAGA 


AACAAAACCT 


AACTTCATTT 


CAAGAACTAA 


CAACTACCGA 


ACTCAACCAA 


420 


ATTACAGGTG 


GAGAATGGTG 


GGAAGAACTC 


TTACATGAAA 


CAATTTTAAG 


TAAATTTAAA 


480 


ATCACAAAAG 


CACTTGAACT 


ACCTATTCAG 


CTATAAAAAC 


AAGACCGAGA 


AACAAGAACT 


540 


CTCGGTCTTG 


TTTTTTATCA 


TTCTGCATGT 


ATCACAGTAA 


GTACCTGACG 


AAAGACTTGA 


600 


TTTTGGCGAG 


GTAGTATT 










618 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 



60 



CAGGGGATGC CAGTTGATTT GCTTCAAGTT GACTTGACTC GTACTTGGGA AATCC TCGG A 

GAAATCACTG GGGATGCTGC TCCAGATGAA CTCATCACCC AACTCTTTAG CC AATTCTGT 120 

TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA TGGATTTTAT TGTCTTTATT 180 

AGTAATCTGG TCTTAAGACC C CTGTTAC AG TTGCCTTAGT TGCTTCGTAG TCGCCATCTA 240 

CGACAACCTT GATAATGCGT TTGACATCTT CTTCTGGTGC TGGAACAAGA GGTAGACGAG 3 00 

TGGGTCCAGC TTCAAATCCC ATATAGTTAA GAATTGCCTT AACTGGAGCA GGACTTGGAT 3 60 

AAGAGAAGAG AGCATTAACC TTAGGAATGA ATTTACGCTG AATTGCTGCG GCTTTCTTCA 42 0 

TATCGCTTTC TGCAATGGCA GTAAACATCT CGTGCATTTC ATCCCCATTT GTATGAGAAG 480 

CAACAGAAAT AACCCATCCG CCCCAAGGTT CATGGCATGG AAAGCATCTC CATCCTCACC 540 
TGTATAAAT 

(2) INFORMATION FOR SEQ ID NO: 129: 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 455 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION : SEQ ID MO: 129: 



AACACCGACA 


GAGATAGAAG 


CAATGAGCCC 


60 


GCGTCCTGCC 


TCATCAAAAC 


GGTGCATAAG 


120 


CTGCTACATT 


TTCATCCAAA 


ATCGTCTGCA 


180 


TGTGATTTCC 


TAAAGTGACA 


CTTGTATCCG 


240 


TTGGAATCAA 


TTGAATATCT 


TCGTAATCAA 


300 


CTCGTTTCTT 


TTGTAATGAC 


CTACCTATGC 


360 


CCCTTAAATT 


TATTATAAAC 


CAAAAGTACA 


420 


ATATC 






455 



(2) INFORMATION FOR SEQ ID NO: 130 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 



CTCCTGGGTT TTGATTAACT CTAGGGTTGC CAAAAAGAGG GTGATGACTT CTTGGACATT 

caca^taItc TTCAACAAAT CCTGCAAGCG CAATTCATTT ™™ -™ 

CACAATAATC ATCATGTCCT CAATCTTATA CTCATCCCGC AAGATAGTCG TGTGATTTTG 
TGCAAACTCC TCTTTTTTCT TGGCTAGGAT ATTTGAAAAA GCCAAAAAGA GG^T^T 
CGTCTTGTCA TGCACAAGCT CCGCATCTTC GTAAATCAAC TCTATCGGCG c7^Z 
T^T CC CGTrTCTTGG ^CTTGGCTT CCAAGTGCTC ACCCAAGAGC 7^Z C 

r= ~ — ~ = ~i 
~ = = ~ — £= 

~ = = ~ — EE 

GTTTTCTATT TGTCCTTAGA ATAAACTGTT CACGTAAGAC TTGAGATCGA A^TCCT 



(2) INFORMATION FOR SEQ ID NO: 131; 

135 



6 0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
776 



BNS0OCID: <WO_ L _9737026A1_I_> 



WO 97/37026 



PCT7US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



ATTGATACCN 


ATTGTACAGG 


AC AACTT ATT 


TTGGAGCGCC 


ATAGCAGGGA 


TANGGTTAGC 


60 


AATAGATGCG 


NTGACCATAG 


AAACGTTTNT 


TAGCCAATAC 


AT AAC CTGCT 


AGAGATGAGG 


120 


TTGCACAAAC 


TAAGAACATG 


GTTACCAATG 


AGATAAATAC 


TGAGTTC C AC 


ATCCATTGCA 


180 


AGGCAGGGTT 


CTGCACCATG 


AGTTGTTGGA 


AGTTTTCCAT 


GGTTGGCATT 


TTAGGGAACC 


240 


ACTGAGGAGG 


AATAACAATT 


GTATCAGGTT 


GTGATTTGAA 


TGCCCCTGTC 


AAAATCCAGT 


300 


AGAATGGAAA 


GATGNACAGC 


ACAGTCAACA 


AG AGC AAAAT 


GATTGTTGAA 


ATNACAGTAA 


360 


AGGCTGTTAA 


TGGTTTTTTT 


TCTGTAGATT 


GCATAGCTGT 


CTCCTTTCTT 


TAGTATTCTA 


420 


CGTCGTTTCC 


AAGTACTTTA 


AATTGAACAA 


AGCTTACGAT 


AGCAATCATG 


ACTGCCAAGA 


480 


AG AC AC C AAT 


TGTGTTGGCA 


TAGCCGTATT 


CTGTCAATTG 


GAAGGCTTTT 


TCGTAAAGGT 


540 


AGTACATCAA 


GGTACTTGTT 


GAGTAGTTTG 


GACCACCAGA 


TGTCAAAAGC 


TGAATCAAGG 


600 


CGAAACACTG 


GAATGAGTTA 


ATTGTTGTGA 


TGATTGCAAT 


ATAAAGAGTT 


GTTGGAAGAA 


660 


GGCTTGGCCA 


TTTAATCTTC 


CAAAAAACTT 


GAAACTCAGT 


TGCACCATCA 


ACACG 


715 



(2)' INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



CTGGGCTGAG 


TTTTCCAGAT 


ACAAATACTG 


CAACCGCAAC 


TGCAATCCCC 


CAACCCATAG 


60 


TAATCACAAT 


CCAACCTGCA 


CTGTTGCTCT 


TGGTTTTTGG 


AAGAACCACA 


CCTGCAACAA 


120 


CACCATTTCC 


TAGAAGAATC 


AGGATTAAAG 


TCCCTAAAAA 


TTCTCCAAAT 


AATTCACTCA 


180 


TCATTTTTCT 


GTCTCCATTA 


AAAAGAAGGG 


GCGGGCGACA 


AGGATTGCTA 


CCCTCCACCT 


240 


CTTTTATTTT 


TTCTTAATTT 


TTTAATTCTG 


CTAAGTCGTT 


TTGAGCAAGA 


GCTGCTTTTA 


300 


CATCAGCACG 


GTAAGTTGCT 


TTTTCTTCTT 


CTGTCCAGTC 


ATAGAATCGT 


CCCATTTCAT 


360 


CCAAAACTGG 


CTCAACGATA 


CTATCCAAGC 


TATCACGCAT 


AAAGAGCATG 


TGATTGGTAC 


420 


GACGAAGAAG 


GAAGTCAACT 


GGGCTAAGAG 


TCAACTCATT 


GCGCATTGCA 


TAGTGAAGGG 


480 



136 



WO 97/37026 
ACAAG 



PCT/US97/G5306 



485 



(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

CTTAATTATA ACTTAAACAC AAAAGTTTTA CACAAACTGT GGATAACTCT TTTGAAACTG 
TGATTTTCTT AATTGAAATC TATTTTTTAT TTTGTGAATA AGATGTGAAA AAATAGAGAA 
TATGTTAGAA TAGAGTCATG AAAATTAAAG TTGTAACAGT TGGGAAACTG AAAGAAAAGT 
ATTTAAAAGA TGGTATCGCA GAGTATTCAA AACGAATTTC TAGATTTGCT AAGTTTGAAA 
TGATTGAGTT ATCAGATGAA AAAACACCAG ATAAGGCCAG TGAATCAGAA AATCAAAAGA 
TTTTAGAAAT AGAAGGTCAG AGAATTTTAT CAAAAATTGC TGACCGTGAT TTCGTTATTG 
TGTTAGCCAT TGAAGGGAAA ACTTTCTTCT CAGAAGAATT TAGTAAGCAG TTAGAAGAAA 
CTTCTATAAA AGGANTTTCT ACTCTTACTT TTATTATTGG GGGAAGTTTA GGATTGTCAT 
CATCTGTAAA AAATAGAGCC AATCTTTCTG TCAGTTTTGG TCGCCTAACC TTGCCTCATC 
AGTTAATGAG ACTAGTTCTT GTTGAACAAA TCTATCGCGC TTTTACGATT CAGCAGGGAT 
TCCCCTACCA TAAATAGAGA ATTGACTTTT AATTGAATTT TTGGTAGAAT AATTGTGTTA 
GGTCTCATAG 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 50.6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

CTATAATTTT ATTAGCTGTA TTCCAGTCGT CACGACCAAA CTCTGTTACA GGGACACGAA 
TGTCAAAACG GTTCTCAATC TCCACAATCA AC TCAACCGT TCCCATACTA TCCA^GAC^ 
CTGCATCAAA AAGATCTTCA TCCATCATGT CAGAAACATC TTCCATAAAC AAC^A^ 
TAATTTCGAT AACTTCTGAT TTGATATCCA TATTTTATTT CCTTTTATTT TTTAAACCAT 

*=a™ttc* MAATC caca aaaoattaao „ TCA c«c A ^caacato eJ^Z 
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ACCATGCCAA GCAACTGAAT CCAGCGATTC TCAGGTAGGG CAGCCTTCCC TGCTTTTTTC 
CGTTCCTTAT TGAGCGTTTT TTTCTTGCGA ACCCAGGCAT CATTGATGAA CAAGCCTAAG 
TCCATGAAAG AGTCCCATAG GCGATATAGT ACCAGGTCAC ACCATGCCAA AATCCCCATA 
ATCCAGCATA TTTACAATGT AAGATC 



480 



420 



360 



506 



(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 135: 



ATGGTTCGAC 


CCACAAATAT 


GTTTTGCGAA 


TCATTGACCA 


TTCTATGATT 


TACGTTGCCA 


60 


TTGCCGGCTC 


ATACACGCCC 


GTTGTCTTGA 


CCTTGATGAA 


TAACTGGTTT 


GGCTATCTGA 


120 


TTATTGTCAT 


C C AATGGGG A 


ACGACCATCT 


TTGGTATTCT 


CTATAAAATC 


TTTGCTAAAA 


180 


AGGTCAATGA 


GAAATTTAGC 


CTTGCTCTTT 


ACCTGATTAT 


GGGCTGGTTG 


GTTCTGGCTA 


240 


TCATTCCTGC 


CATTATCAGT 


CAAACNACAC 


CCGTTTTCTG 


GATTCTCATG 


GTAACTGGCG 


300 


GACTCTGTTA 


TACAGTTGGA 


GCTGATTGAT 


AGC CTTTATG 


CATTGAAATT 


TG AC AGTTTG 


360 


ACTTATGAAA 


ATAAAGCAGA 


GGTTATGGAC 


TTTATCAAGG 


CTCGTGTTGA 


TAAGATGATG 


420 


GGCTCTACTC 


CAAAAGATAT 


CAAGGGAAGC 


AGTTCTTGCA 


GGTTCAAACT 


TTGTTGTGGC 


480 


AGATATGTTT 


GGAAGCAGCA 


AGTGCTCTCG 


TAGAAGTAAG 


CAAGGAAGAA 


GATTTTAAAC 


540 


CATCTGTTGA 


ATCACTTTCT 


CGTGCCTTTA 


ACCTGGCTGA 


GAAGGCAGAA 


GGGGTTGC T A 


600 


CAGTTGATTC 


AGCACTATTT 


GAGAATGACC 


AAGAAAAAGC 


TTTGGCAGAA 


GCAGTAGAAA 


660 


CACTCGTTTT 


ATCAGGACCT 


GCAAGTCAGC 


AATTGAAACA 


ACTCTTTGCG 


CTTAGCCCAG 


720 


TCATTGATGC 


TTTCTTTGAA 


AATACTATGG 


TAATGGCTGA 


AGATCAGGCT 


GTCCGTCAAA 


780 


ATCGTTTGGC 


AATCTTGTCA 


CAACTAACCA 


AGAAAGCAGC 


TAAGTTTGCT 


TGTTTTAACC 


840 


AAATTAACAC 


TAAATAAAAT 


TTGATAAACG 


GACTTTATCT 


TATTACAAAG 


GAGAAGAAAT 


900 


GGATCCGAAN 


ANATTGCTCG 


T 








921 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



138 



WO 97/37026 PCX/US97/0S306 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

CTTGGCAACA GGCTCTTTTT GTATTGCGTT TATTAAGAAA AACAAAAAAC ATCGATATAA 
CATTTAACTA AAAATTATTG TATGTTATCT CCCTTAATTA GGAATGATAA GGGAATAAC- 
AGAAAGATTT GTGAATACAA ACTATTTCTG ATATACAAAA TATACAGTAA TAATGAATGA 
TGGGAGATGG GATGAAAGAA TTTCAATTTG AGAGAAAGCA GCGTTTTTCT TTGAGGCCAT 
ATGCAATAGG AGCTTGTTCG GTCTTGCTAG GAACGAGTTT ATTTTTTGCT GGTATGGGTG 
CTCAGCCTGT ACAGGATACA GAAACGAGTT CAGCACTAAT TTCAAGTCAT TATTTGGATG 
AGCAGGATTT ATCTGAAAAG CTGAAATCTG AGTTGCAATG GTTTGAATTA GAAAACAAGC 
TTTTGAACTT ATGGGAACAT TAGGTTACTA TGAAGGATTT GTTCCTTATG TTTCAAATCA 
ATACAAAAAC CAAGCTGAAG AAGAAGGCAA ACCGCTATCT GATAAATATA TTTTCGAAAA 
ATCTTAAGAA AAACATATGC ACCTTCAAAA A 

(2) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GTATCAACTC CAAAGAGCAA GTGACTGTCA AAGTTGTAAC AGATGCGGCC AAGAAGCTCA 
TGGGTGATAA GATTGCTCGC CAGAAAGAAC GTGGGATTCA GATTGANACC TTGCGNACCA 
TGATTATNGG GATTCCANAC GNTGGTAAAT CCACTCTGAT GAACCGTTTG GCTGGTAAAA 
AGATTGCTGT TGTTGGAAAC AAGCCAGGGG TCACAAAAGG TCAACAATGG CTTAAAACCA 
GGAAATACTT GGATACACCG GGGATTCTCT GGCCTATAGT TTGAGGATGA 
AACTGTTGCA CTTAAGTTGG CATTGACTGG AGC TATC AAG GATCAGTTGC TTCCTATGGA 
ACGC^T C ^AATTATTT CAAAGAACAT. TATCCAGAAA AGC^GC^ 

ACGCTTCAAA CAAATGAAAA TTGAAGAAGA AGCGCCTGTG ATTATTATGG ATATGACCCG 
CGCCCTCGGT TTCCGTGATG ACTATGACCG TTTTTACAGT CTCTTCCGTG aIgG^CC 

GAACTAAGTC ~™ «« 

611 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

139 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



TTGCTAAAGC 


AGCCATGATG 


ANGGCACTCC 


ATAAGGATAA 


GTACCACCAA 


GCAGCAGCAG 


60 


ANGACCATAA 


TCTCCTTTAT 


GACTTGAACG 


AGAACGTTCA 


ATAATAACTT 


TTTCTAGTAA 


120 


GGTTTGATTA 


ATCACTTTCA 


TCCTTTTTCC 


CTCTCACTTT 


TATTATACAA 


CAAAAAGGAG . 


180 


ACGCAGACCT 


CCTTTTGTAA 


TCTTATATCT 


AAAATTTAAT 


ATTCATTTCT 


GCCATTTTAG 


240 


ATATAGCTAT 


AGAAAATACA 


CTCTATTAAT 


CGAATGTTTC 


TCTTATTTTC 


TATCCAATGT 


300 


CCGAAGTGCT 


GCTTGATAAG 


TTTGCTCCAT 


CAGCATGGTA 


ATGGTCATAG 


GACCGACACC 


360 


TCCAGGGACT 


GGC GTGAT AT 


GGCTAGCAAG 


TGGTGCAACT 


GCCTCATAAT 


CAACATCTCC 


420 


ACAGAGCTTC 


CCATTTTCAT 


CTCGGTTCAT 


CCCAACGTCA 


ATGACAACCG 


CACCTGGTTT 


480 


GACAAAGTCA 


GCAGTCACAA 


ACTTGGCGCG 


GCCGATTGCG 


ACTACAAGAA 


TATCTGCTTT 


540 


AGCAGCCACC 


TTGGCAAGAT 


TATGAGTTCG 


TGAGTGGGCC 


AAGGTTACTG 


TCGCATTTTT 


600 


AGCCAAAAGA 


AGCTGAGCCA 


TAGGTTTTCC 


AACGATATTT 


GAACGACCGA 


TTACGACCGC 


660 


ATTTTTACCT 


TCCAAGTCAA 


TCCCATATTC 


ATGAAACATT 


TCCATAATTC 


CTGCAGGTGT 


720 


CGAGGGAATC 


ATGACTGGAT 


GTCCAGACCA 


AAGACGTCCC 


ATGTTTAGGG 


GATGGAAACC 


780 


ATCCACATCC 


TTTTCTGGGT 


CAATGGCTAA 


TAAAACCGCC 


TCTTCATCGA 


TATGTTTTGG 


840 


TAATGGCAAC 


TGGACCAAAA 


TCCCATGCCA 


AGCTGGATCC 


CTGATTATAT 


TTAGCAATCA 


900 


GGTCTAACAA 


TTCCTCTTGA 


GTTAATGGTC 


TCTGGAACTC 


GC ACT AC TTC 


GCTACGGGAA 


960 


CCAGCCGCAA 


GAGCTGACCT 


CTCCTTGTTG 


CGAACGTTAA 


ACTTGGCTGG 


CTGGATTATC 


L020 


CCCAACCAAA 


ATC ACT ACC A 


ACCAGGCACT 


AGAG 






1054 



(2)' INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



CTGAACTGGA 


TTTTGCTACA 


AGGTCTGATT 


TGTCAAAGAG 


ATGCGAGTTA 


TGATATGAAG 


60 


CAGGATGATT 


TGGATAAGGT 


AGCAGATTAT 


CTCTTCAAAA 


CAGAAGAATG 


GACCATGTAT 


120 


GAGTTGATTC 


TTTTCGGTAA 


CCTCTATAGT 


TTCTACGATG 


TAGACTATGT 


CACTCGGATT 


180 


GGTAGAGAAG 


TTATGGAGAG 


GGAGGAATTT 


TACCAAGAGA 


TTAGTCGCCA 


TAAGAGATTA 


240 


GTGTTGATTT 


TGGCCCTCAA 


TTGTTACCAG 


CATTGTTTAG 


AGCATTCTTC 


TTTTTATAAT 


300 


GCCAACTATT 


TTGAGGCTTA 


TACAGAGAAG 


ATTATTGACA 


AAGGTATTAA 


GCTTTATGAG 


360 


CGTAATGTTT 


TCCATTATTT 


AAAAGGTTTT 


GCCTTATATC 


AAAAAGGACA 


GTGTAAAGAA 


420 


GGCTGTAAGC 


AGATGCAAGA 


GACCATGCAT 


ATTTTTGATG 


TGTTAGGTCT 


TCCAGAGCAA 


480 
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GTTAGCCTAT TATCAGGAAC ACTACGAAAA ATTTGTCAAA AGTTAATTTT CCCAAATAAG 
GGAAAAAATA AAAAGCTCCT TTCGGTTTTG ATACAATAGT TTCAAAATTT GAGAGGAG 



540 
598 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) . TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 



<xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


140: 






CTACTTCCCG 


GCCGAGTCGA 


TTCTCCGCCG 


ATATCGAGCA 


TACTGGCTCC 


TTCTGTTATC 


60 


AATTTACGAG 


CCTGCTGGAG 


CGCCTGCTCA 


AGAGCAAAAA 


ATTGACCACC 


GTCCGAAAAG 


120 


GAGTCTGGGG 


TTACATTGAT 


AATTCCGCAA 


ATAGCTGTCT 


TTGCATGAGT 


GGCTTTACTG 


180 


GACATATCGG 


TCACTCCCTC 


AAGGCTATTC 


ATCATATTAT 


TTCTCTATTT 


TACCATAAAA 


240 


AGAAAAAGAT 


GGACACGATT 


TCATTCATCT 


TTCTCCCAGT 


AGAAACAAGT 


AAGCAATTGT 


300 


CAATAATCTT 


AAACAGAAAT 


CCCTAATGTC 


CGACTCATAA 


TCACCACAAG 


AGCCAACAAA 


360 


CAGAAAGCAA 


TCCCATTAAC 


AATCATGTGA 


AGTAAGATCG 


ACATTTCCAA 


ACGTTGGGTC 


420 


TTGTAGGCTG 


TCCAAGATAG 


AACTGTCGAC 


ATACCTCCAT 


AAATCAATAA 


AGAAGGTAAA 


480 


TTACTTGGTT 


GATGTAATAA 


AGCAAACACA 


ATCGTACCGA 


CTACAAAT 




528 



(2) INFORMATION FOR SEQ ID NO: 141: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


141: 






C T ATGAGAG A 


GAATGACCTT 


CTCTTGATTA 


CTGCGGACCA 


TGGAAATGAC 


CCAACGTATG 


60 


CAGGAACGGA 


TCACACTCGG 


GAATATATTC 


CATTGTTGGC 


CTATAGCCCT 


GCCTTTAAAG 


120 


GAAATGGTCT 


CATTCCAGTA 


GGACATTTTG 


CAGATATTTC 


AGCGACTGTT 


GCCGATAACT 


180 


TTGGTGTGGA 


AACTGC TATG 


ATTGGGGAAA 


GTTTCTTAGA 


TAAATTGGTA 


TAAGATGACG 


240 


CGCTATGCTT 


TGC TGGTGAG 


AGGTATCAAT 


GTTGGTGGTA 


AGAATAAGGT 


CGTCATGGCG 


300 


GAGCTTCGTC 


AAGAATTGAC 


AAACTTGGGA 


CTGGAAAAAG 


GTTGAGAGCT 


ACATCAATAG 


360 


TGGCAATATT 


TTCCTTTACT 


TCGATAGATT 


CCAAAGCCCC 


AATTGGTTGA 


AAAAGCTAGA 


420 
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GACTTTCTTT GCAGTCCATT ATCCATTTAT TCCAGAACTT TTTCCTTTAC TGAGTCTAGA 
GGACTTTGAA GGGCGGAACT TGAAA 



480 

505 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



CTATTTTCTC 


ATCTTTTTTA 


GACTGGTAAT 


GTATCCCCTA 


GAGTTTCATT 


GTTAAGACCT 


60 


GTCGTATTGA 


CAACAGTGGA 


TCCTGATTC C 


CCATTCCAAG 


AGTTTTGCC C 


GCATCATATN 


120 


GACGAAATCT 


TTTTCTGAGC 


CAGCAATTTT 


CTCAGCTAGG 


GCAATAGCGG 


CGCTGTTGGC 


180 


AC T AGATACC 


AGAGTTGCTT 


CAAGCAACTC 


TTCGACAGTA 


TAATTACGGG 


CCTCCATAGG 


240 


AATATTACTG 


GCTTCAGAAT 


TTGTCGTCAA 


TTGATAAGGA 


TAATCAGAAA 


TATCTACAGG 


300 


AGTGGAGAGG 


GTAATACTTC 


CGTTTTCCAA 


AGCTTCATAG 


AC C AG ATAAA 


CAGTAATCAA 


360 


TTTTGTTATG 


GAAGCAATTT 


CGACAGGTGG 


CGTTGCATCC 


TTCTCATAGA 


GAATTTTACC 


420 


AGTATTTGCC 


TCAACAGCAA 


TCGCATGTTT 


AGCGGCAATG 


GTAAAATCTT 


GAGCAACAGC 


480 


AGTAGAAGCA 


CCCCCTAAAA 


GAGAGACAGT 


TAACAAAGTT 


AAAAATATTT 


TTTTCATAGT 


540 


AGTCTTATTC 


TATCATAAAG 


AAAAAAAATA 


TTCTTGCTTT 


AATAATTCAT 


CTGTTAAGCT 


600 


TTTTGAAAAT 


ATGGTAAAAT 


AAAGTGAGGG 


AGGTAACTCA 


TGTTTCGTAG 


AAATAAATTA 


660 


TTTTTTTGGA 


CCACAGAAAT 


TTTACTCTTA 


ACCATCATCT 


TTTAC CTATG 


GAGACAGATG 


720 


GGGTCTTTGA 


TTAACCCTTT 


TGTTAGCGTG 


CTTAATACAA 


TTATGATTCC 


ATTTTTATTA 


780 


GGGGGCTTTC 


TTTATTATTT 


GACAAACCCT 


ATTGTTACTT 


TCTTAAATAA 


AGTCTGTAAA 


840 


CTCAATCGTT 


TGCTTGGTAT 


TTTAATTACC 


TTGTGT 






876 



(2) INFORMATION FOR SEQ ID NO: 143: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
CTTGTCAAAT GCCATGGAAG GGGTTCTTTA CTTCCTCAAA CCAGACTTTT CAAAACTGAC 60 

142 
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TAGCGCTGAT CTCCTCTATG CTCTGGGACA ATCTTTCTTT GCCCTCTCAC TAGGGGTTAC 12 0 

AGACATGTTG ACCTATGCTT CTTACTTGGA CAAGAAAACC AATCTAGTCC AGTCAGGAAT 180 
CTCCATCGTA ACCATGAATA TCTCGATAGT CCATCATGGA AGGTCTAGCC ATTTTCCCAG 240 
CCATGTCAGC CTTCAATATC CACTCTGAAA GGGGACCCAG CCTGCTCTTT ATCGTCTTGC 300 
CTCAACTCTT TGACAAGATG CCTTTTGGAA CCATTTGCTA CGTCCTCTTC CTCTTTGCAA 3 60 

CTGTCACTTC TTCTGTCGGT GATGCTGGAG ATCAATGTGG GCAATGTCAC CAACCAGGAT 
AACAGCAAAC GTGCCAAATG GAGTGTTATT TTAGGAATTT TGACCTTTGT CTTTGGCATT 
CCTTCAGCCC TATCTTACGG TGTCATGGCG GATGTTCACA T 



(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
Ui) SEQUENCE DESCRIPTION : SEQ ID NO: 145: 



143 



420 
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(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

CTGCAGACAA TAACTTGGTT AGCTTAACGG CTCTTGAGGA TTCATCCAAG GATGTAACCT 6 0 

TTACCAGTTC GGCTTTCAAT CTAAAAGAAG GGCGACACCT TCAAAAAGGG GATTCCAAGA 12 0 

AAATCCTTAT CCACGAAGAA TTGGCTAAGA AGAACGGTCT TTCACTTCAT GACAAGATTG 180 
GCTTGGATGC TGGTCAGTCT GAATCTGGAA AAGGACAAAC AGTAGAGTTT GAGATTATCG 240 
GCATCTTTCC TGGTAAAAAA CAAGAGAAAT TCACAGGCTT GTCTTCTGAC TTCAGTGAAA 
ATCAAGTCTT TACAGATTAT GAAAGTAGCC AAACCCTTTT GGGCAATAGT GAAGCTCAAG 
TCAGTGCAGC ACGCTTC TAT GTTAGAAAAT CCTAAGGAAA TCGACGGACT CATGAAGCAG 
GTAGAAAACT TGGCCTTGGA AAATCAAGGC TACCAAGTCG AAAAGGAAAA CAAGGCTTTT 
GAACAAATCA AAGACTC AG T TGCAACTTTC CAAACCTTCC TGACCATCTT CCTTTATGGA 
TGTTGATAGC AGA 



300 
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CTATGGATGC 


GGAAGTTTCT 


AAGAAC CTTC 


GCTTGATCCT 


TGAGCGTAAA 


GGAATGACCA 


60 


TCTTGACTGG 


TACTAAACTG 


CAAGAAATCA 


TTGAGGAAAA 


TGGTCAACTT 


CGTATCAAGG 


120 


TTGAAGGAAA 


AGACAATATC 


ATCGCAAGCA 


AAGCTCTTCT 


TTCAATTGGT 


CGTATGCCAG 


180 


ACCTTGAAGG 


TATTGGAGAG 


GTTGAGTTTG 


AATTGGATCG 


TGGTTGTATC 


AAGGTCAATG 


240 


AATACATGGA 


AACTTCAGTT 


CCACGCATTT 


ATGCCACCAG 


GTGACATCAA 


CGGTACTAAG 


300 


ATGTTGGCTC 


ACGCAGCTTT 


CCGCATGGGT 


GAAGTTTCCG 


CTGAAAATGC 


CCTTAAAGGA 


360 


AATCATGCAG 


TTGCCAAATT 


GAATTTGACT 


CCTGCAGCCA 


TCTACACTCT 


CCCTGAAGTA 


420 


GCAGCAGTAG 


GTTTGACAGA 


AGAACAAGCC 


CGTGAGAAAT 


ACGATGTTGC 


CATCGGTAAG 


480 


TTTAACTTTG 


CTGCTAACGG 


TCGTGCTATT 


GCATCTGACG 


CAG 




523 



(2) INFORMATION FOR SEQ ID NO: 14 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 



CTGCAGGTGG 


AATTATGAAT 


GTCGGATATG 


AAAAAGCATT 


CTTGATGCAG 


ACATCGTTAA 


60 


ATTTGC C AAC 


TTCTGAAATT 


ATCTCGACAT 


ATGTCTATAA 


AGTTGGTCTT 


GTATCAGGAG 


120 


ACTATTCTTA 


CTCAACAGCG 


GTTGGTTTGT 


TTAATGCAGT 


GATTAACGTA 


GTATTGCTTG 


180 


TTGCAGTTAA 


CCAAATCGTT 


AAACGCATGA 


ATAATGGTGA 


AGGAATTTAA 


GGAGGAAAGT 


240 


ATGAAAAATT 


CGATTATGGA 


TACAAAATTT 


GATAGACGTA 


TCTTACTCTT 


AAATAAAATC 


300 


ATTATTGTCT 


TTATCGTTTT 


GATGACTTTG 


CTTCCTTTAC 


TTTATATCGT 


CGTAGCATCC 


360 


TTTATGGATC 


CTAAGGTTCT 


GGTTAGTAGA 


GGGATTAGCT 


TTAATCCAGC 


CGATTGGACT 


420 


GTAGAAGGTT 


ACCAGCGTGT 


TATTCAGTGA 


CCAATCTATT 


CTAAGAAGTT 


TTATCAATTC 


480 


CCTACTATAC 


TCTTTTGGAT 


TTGCAGCT 








508 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

144 
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CTAAAAAGAA CCAGATGGAT CTTGTGATGA ATTTACATCA TTTTGATTTA CCAGTGGAAC 60 

TTCTTCAAAA ATACGGTGGT TGGGAAAGCA AACATGTAGT GGAGTTATTC GTGAAGTTTG 120 

CCAAGACTGC TTTCAACATG CTTTGGAGAT AAGGTTCATT ACTGGACAAC TTTCAATGAG 180 

CCAATGGTCA TTCCAGAAGC AGGATACTTA TATGCTTTCC ATTATCCAAA TCTAAAAGGA 240 

AAGGGAAAAG AGGCCGTACA AGTCATCTAT AATCTAAACC TTGCTAGTGC AAAAGTGATT 300 

CAACTATATC GCTCATTAGG ACTTGATGGA AAGATTGGGA TTATTTTAAA CTTGACACCT 360 

GCTTATCCAA GAAGTAATTC TCCAGAAGAC TTAGAAGCAA GTCGATTTAC AGATGACTTC 420 

TTTAACAAAG TCTTCCTTGA ATCCAGCTGT TAAAGGAACT TTCCCAGAAA AGATTGGTAA 480 
AAACAGCTAG AGAGAGATGG CGTGTTATGG AGTCAT AC CG AAAAAGAG 



(2) INFORMATION FOR SEQ ID NO: 148 



528 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 8: 

ACTCTTATTG GTGAGGTATC GCAACTTCTG TTTCTGCCTT CCTTGGTGGA CCAGCCAATA 
CAACTTACGG AGAAAATACA GGGGTTATCG GTATGACTCG TATCGCTTCT GTCTCAGTTA 
TCCGTAACGC TGCCTTCATC GCGATTGCCC TCAGCTTCCT TGGTAAATTC ACTGCCTTGA 
TTTCAACTAT TCCAAACGCT GTACTTGGTG GTATGTCAAT CCTTCTCTAT GGGGTTATCG 
CCAGCAATGG TTTGAAAGTC TTGATTAAAG AACGTGTGGA TTTCGCTCAA ATGCGAAACC 
TCATCATCGC AAGTGCTATG TTGGTTCTTG GACTTGGGAG GAGCTATCCT TAAACTTGGT 
CCAGTTACAC TTTCAGGTAC TGCCCTTTCA GCCATGACAG GAATCATCTT GAACTTGATC 
TTGCCATACG AAAATAAAGA CTAAGAGTCT AAATACACCT AATCCACTCA GACAGCTGAG 
TGGATTTTTC GTATACCATA ATAAAAGTGT CTTAACAAAA TTATTAAAAT CAAAAAACGT 
ATAATATCAG AT ATTC T AAA ACCTTGATAC TGTACGTTTT ATCATAGAAA TTTTTACTTT 
ATTTTCTCAT CAAATGAGAT TTGCATCAAT CTCTTGTCTT ACTTGCGTTT CTTCTTCGCT 
TTCTTCATTT TGTTAG CC AT ACGTTTCATG GACTGTTTCA TGGCAAATTC ACCAATTTTA 
CCTTTCAAAC CGCCACCAAA CATCTGGCTC ATATCTGGCA TTCCTGCTCC TCCGAGAG 

(2) INFORMATION FOR SEQ ID NO: 149:. 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



145 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 



GGGAGTTGGT 


TGCTATTNGT 


GAGGGAAAAG 


GTTTGAACTT 


TGAAATCGAA 


GTGGATGGTG 


60 


GGATTGATGA 


CCAAACTATT 


GCTCAAGCCA 


AAGAAGCTGG 


TGCGACTGTT 


TTTGTAGCAG 


120 


GTTCCTATGT 


CTTTAAGGGA 


GAAGTCAATG 


AGCGAGTACA 


AACTCTCAGA 


AAACAACTGG 


180 


ACTAGGGTTG 


CAGTTTTTGC 


AGGCGGAAAC 


CGCGGTCATT 


ATCGGACAGA 


TTTTGATGCT 


240 


TTTGTTGGGG 


TGGATCGAGG 


CTCGCTCTGG 


GTCTTGGAAG 


AAGACTTACC 


TCTTGCTCTA 


300 


GCAGTCGGAG 


ATTTTGATTC 


TGTGACGGAA 


GAAGAGCGAC 


AGGTGATTCA 


AAAACGTGCC 


360 


CAGTATTTTG 


TCCAAGCCCG 


GCCAGAAAAA 


GATGATACTG 


ATTTGGAATT 


GGCTCTCTTA 


420 


ACCATCTTTG 


AACAAAATCC 


TCAGGC TGAG 


GTCACTATTT 


TCGGTGCCTT 


GGGTGGC CGT 


480 


ATTGACCATA 


TGTTGGCCAA 


TGTTCTTTCT 


ACCTAGCAAT 


CCCTAAGTTG 


GCACCCTATA 


540 


TGCGTTCNAA 


TAGAAATTGA 


GGAATGGGCA 


AATTTGATTG 


CCTATTGTTC 


CAGAAGGGAA 


600 


CAGTCAGATT 


CGAATCCCTG 


CA 
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(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 497 base pairs 
<B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



CTATTATCAG 


CACTTTTATA 


CCTATTACCA 


AAAAAATCCT 


CAAAATGATT 


GGC AAAAGGC 


60 


TTTATCCACC 


AGCCATTTAT 


CAGCAATATT 


TACTTGAAAA 


ATATGGTAGA 


AATAGAAAGG 


120 


ATGGAGGAAT 


CTAATGGTAT 


TACAAAGAAA 


TGAAATAAAT 


GAAAAAGATA 


CATGGGATCT 


180 


ATCAACGATC 


TACCCAACTG 


ACCAGGCTTG 


GGAAGAAGCC 


TTAAAAGATT 


TAACAGAACA 


240 


ATTGGAGACA 


GTAGCCCAGT 


ATGAAGGCCA 


TCTCTTGGAT 


AGTGCGGATA 


ACCTACTAGT 


300 


AAATCACTGA 


ATTTTCTCTT 


GAAATGGAAC 


GCCAGATGGA 


GAAGCTTTAC 


GTTTATGCTC 


360 


ATATGAAGAA 


TGACCAGGAT 


ACACGTGTAA 


GCTAAGTATC 


AAGAGTACTA 


TGCCAAGGCC 


420 


ATGAC AC TCC 


TACAGCCAGT 


TAGACCAAGC 


CTTTTCATTC 


TATGATCCTG 


AATTTATGGA 


480 


TATTAGCGAA 


AAGCAGT 










497 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single. 
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(D) TOPOLOGY : linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

CTGTTATGGA GTTGAAAGAA GAATACCACC CAGACGGTAA AGGTTTTGAA ATGATCTGTA 
TCGGTGGGAT GGGAGCTGAT TTCTTTAAGG CTCGCGGTAT TCAACCACTT TATGAATTAC 
GTGGCTTGTC AGACCAACCT AGCTTTGATC AAGTTCGTAA GATTATTTCA AAAACTGTTG 
AAATGTACCA . AAATGAACTC TTTGATGAGC TTTATGTTTG CTACAACCAC CATGTCAATA 
CGCTAACCAG TCNAATGCGT GTGGAACAAA TGCTTCCGAT TGTTGACTTG GATCCAAATG 
AAGCGGATGA AGANTACAGC TTGACTTTTG AAATTGGGAA ACCAGCCGAG AAGAAATTCT 
GGAGCAGTTG TTGCCTCAGT TTGCAGAAAG TATGATTTAC GGTGCCATTA TCGATGCCAA 
GACAGCTGAG AATGCTGCGG GTATGACAGC CATGCAAACA GCGACAGATA ATGCTAAGAA 
AGTCATCAAT GATTTGACAA TTCAGTATAA CCGTGCCCAG ACAGGCGGAT ATTACACAAG 
AAATTACCGA AATCGTAGCA GTGCCTAATG CCTTAGAATA GCTCTAGTCC CAGCTCTCCT 
CACTC 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

CTACACTTGA GGGAAGTTTG AAAATTCCAT TTTTC TTAAA GGGAATTGTG AAACGCTATG 
TATTTTCTTT TTACAACCGC ATGGAGCACT TGGTTGTGGT CAATCCTATG TTTATTGAGG 
ATTTGGTAGC AGCTGGTATT CCACGTGAAA AAGTGACCTA TATTCC TAAC TTTGTCAACA 
AGGGAAAAAT GGCATCCTCT ACCACAAGAA GAGGTAGTCA GACTGCGCAC AGATCTTGGT 
CTTAGTGACA ATCAGTTTAT CGTAGTAGGT GCTGGGCAAG TTCAGAAACG TAAAGGGATT 
GATGACTTTA TCCGTCTGGC TGAGGAATTG CCTCAGATTA CCTTTATCTG G^G^C 
^™ STCSTAT0AC ™» ^CACTATA AGAAAATTAT 

CCTAAAAATT TGAT^CC AGGCATTGTA TCGCCAGAGC GGATGCGCGA AATTGTATGC 
TCTAACGGAT CTTTTCTTGT TGCCTAATTA CAATGAG TTGTATGC 

(2) INFORMATION FOR SEQ ID NO:153 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION:. SEQ 10 NO: 153: 



TGTTCGCCTC 


CATATCCNAA 


ACTTGCCCTG 


CCACCATACC 


C AG AC TACCT 


GAAGCAAGGG 


60 


ATAAGTTGGC 


AATCGAGCCC 


ACCTNGATCT 


GACTTGGCGA 


ATCTGCCTGC 


GCAATCAAGG 


120 


CATATGGGTC 


TAAGAATAAG 


GATCTCCAGC 


CAAAATGGCC 


ATAGCTTCAC 


CGAATTTCTN 


180 


GTGATTGGTT 


AACCGCCCTC 


TTCGATAATC 


GTCATCATCC 


ATAGCAGGAA 


GGTCATCGTG 


240 


AATCAAGCTC 


CCTGTATGAA 


TCATCTC CAA 


GGCAGTAGCT 


ACCTGCGCGN 


GAGCAGGTTT 


300 


GATGGTAACC 


TGCAAGGCTT 


CCAGAACTTC 


TAACAAGAGA 


AAAGGCCGAA 


TACGCTTGCC 


360 


ACCAGCATGA 


ATAGAATAGA 


GAACAGACTC 


CCGTAAACTA 


GAGGCAAACT 


GCTGGTCTCC 


420 


ATAAAATCTT 


CCAAAGCCGA 


CTCGACAAGA 


GCTAATTTTT 


CTTGCTTTTT 


CATTCAAAAT 


480 


CACTTTCTGT 


TCCGTCTTCT 


TGCATGACCT 


TGACCAAGGT 


CTTTTCAGCC 


TTGTCCAGCG 


540 


TAGCTTGGAG 


C TCTTTTG AC 


AAGAC CATGC 


CCTTTTGAAA 


GGCAGTAATC 


GCATCTTCCA 


600 


GAGCAATTTC 


ACCATTTTCC 


AAACTTTGGA 


CAATGGTTTC 


CAGTTCTGCT 


AGATTTTCCT 


660 


CAAATTTCTT 


TTGTTTTGAC 


ATCTTTAACC 


TCTAATTCTA 


CTTGACCATC 


TCGCATCAAA 


720 


AGCGTTACTT 


GGTCTTTTTT 


CTTCAAACTC 


TCAACCGAAT 


CTACAACGGA 


TCTTCTTTTT 


780 


TGACAATAGC 


ATAACCAC 










798 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:154: 



CCATGTTTGC 


GGCAGTGACA 


GCAGGTTATC 


AGGCTGCCCT 


AATGGTACCA 


ACAGAAATCC 


60 


TCGCAGAGCA 


ACACTTTGAG 


AGTTTACAGA 


ACCTTTTTCC 


CAATTTGAAA 


CTGGCTCTCT 


120 


TGACAGGTTC 


CTTGAAAGCT 


GCAGAAAAGA 


GAG AAGTC TT 


GGAGACCATT 


GCCAAGGGTG 


180 


AGGCTGATTT 


GATTATAGGA 


ACTCACGCTC 


TGATACAAGA 


TGGGGTGGAG 


TATGCTCGTC 


240 


TTGGTTTGAT 


TATTATCGAT 


GAGCAGCACC 


GTTTTGGTGT 


AGGGCAAAGG 


CGTATTTTAC 


300 


GGGAAAAAGG 


CGACAATCCA 


GATGTCCTCA 


TGATGACGGC 


GACTCCCATT 


CCACGGACGC 


360 


TTGCCATCAC 


AGCC TTTGGA 


GATATGGATG 


TTTCCATTAT 


CGACCAGATG 


CCAGCAGGTC 


420 


GGAAGCCCTA 


TTGTGACGCG 


CTGGATCAAA 


CATGAGCAAC 


TACCTCAGGT 


CTTGACTTGG 


480 


TTAGAGGGGG 


AAATTCAAAA 


AGGTTCCCAA. 


GTCTATGTCA 


TCTCTCCTTT 


GATTGAAGAA 


540 
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(2) INFORMATION FOR SEQ ID NO: 155: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 : 

CTCTTTTAAC CGTTTTAGCG GTGACACCGA GGTATTTTTT CAGGACCCAA GACTTGTCGG 60 

GCAACCGAAA CTGGGAGTTC GTCATCTCCA ATATGCAGAC CAGCAGCATC AACCGCAAGA 120 

CAAACATCCA ACCGATCATC GATTATCAAG GGGAC CTG AT AGGCATCTGT TATTTCCTTG 180 

AC TTGTTTTG CCAGTTGATA ATATTGATTG GTTGTGAGAT TTTTTTCTCG CAATTGGACT 24 0 

ATGGTAACCC CTGAACGGCA GGCCGTCTCA ACTTTTGCAA GAAAGCTTTC CACGGAATCT 300 

TGATAGCGAT TGGTTACCAG ATATAGTCTA AGCGCTTCTC TATTCATAAA CCTCTCCTTT 3 60 

GATGGTATCT AGCCAATTTT CATCTCTTCT TAGGAGCGAA AGO TG ATTG A GTACTTGGTA 420 

ACGAAATTCT TCCAATCCCA TTCCTTGAAC AACTATTTTC TCAGCCAGCG ATATTGAGAT 480 

AAGAGACTGC TAAGCAAGAA CTTCAAAACC AGTCTTTC CT TGGCTGAGAA AAACAG 53 6 

(2) INFORMATION FOR SEQ ID NO: 15 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 6: 



CTGGAGTGAC GATTGAGAAA ATCTTTGCCC TAGAAAATTA TCGAAATCAG TTAGCTGCTT 
TTCCGCAAAC TGTCTGGATT TCAGAGGATA TTTTGC TAG A TTTGGCGGAT TCTCAGACTC 
CACAGGGAAT TGTTGCCGTG GTTCAAAAAG AAGAAGTAGG ACAAGCTGAT TTGAGTCAGG 
GCAAGTTCTT GTTTTTGGAA GATGTGCAAG ATCCTGGTAA TGTAGGAACT ATCATTCGAA 
CTGCGGATGC AGCAGGTTTT ACAGGAGTGA TTGTTTCAGA TAAGTCGGCA GACATCTACA 
GTCTTCAAAA CCCTACGTTC CATGCAAGGT AGTCATTTCC ATCTGCCCAT TTACCGGATG 
ACTAGTCAAG CGCTTCTTGA CGAAACTAAA AAGGTAGCTA TCCCAGTGCT AGCAACAACC 
CAATCTAAAG ATTCTGTTGA TTACAGAGAA CTGCCTTCTA TAGAAAATTT TGTACTAGTT 
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ATGGGAAATG 


AGGGTCAAGG 


AATTAGTCCC 


CTTATGGCTG 


AAAGTGCAGA 


CCAGTTGGTC 


540 


CATATTAGCA 


TGAAGGGGCA 


GGCCGAGAGT 


TTGAATGTTG 


CGGTTGCAGC 


CGGTATTTTA 


600 


ATCTTCCATT 


TAAGCTAATT 


TTAACTTTTT 


TTGTTATAAT 


CAAGGAAAGA 


TGTTCACAGA 


660 


AAAGGAGAAA 


TGGATGAATC 


ACACTATTAT 


ACATGACCGT 


GCAGGTCTCA 


ATCAATTTTA 


720 


CGCTAAGGTT 


TATGCCTTTG 


TTGGTCTGGG 


AATCGGACTA 


TCTGCTTTGG 


TATCAGGCCT 


780 


TATGTTGACG 


GTCTTTCAGT 


CTCAGTTGGT 


TTACTTTTTG 


ATGCAGGGGC 


GTCTCTGGTT 


840 


GACCATTGCT 


ACTT 
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(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



CTGAAGAAAT 


CCTAAAAGAT 


TTGTATTACG 


GAGCCGCTAA 


GAAAATTCAA 


GAAGAAGCCT 


60 


CTCGCTGGGC 


AGGAGTTGTA 


AGAAATGACT 


AAAACAGCCT 


TTTTATTTGC 


TGGTCAAGGT 


120 


GCCCAGTATC 


TACGGATGGG 


ACGGGATTTC 


TATGATCAGT 


ATCCGATTGT 


CAAAGAAACG 


180 


ATTGATCGAG 


CGAGTCAGGT 


GCTCGGTTAT 


GATTTACGTT 


ATCTCATCGA 


TACGGAAGAA 


240 


GACAAACTCA 


ATCAGAC CCG 


CTATACGCAA 


CCAGCCATTC 


TAGCGACTTC 


GGTTGCTATC 


300 


TACCGTTTAT 


TGCAAGAAAA 


GGGCTATCAC 


CCTGATATGG 


TTGCTGGTTT 


GTCTCTTGGA 


360 


GAAT AC TCTG 


CCTTGGTGGC 


AAGCGGCGCC 


TTGGATTTTG 


AAGATGCCGT 


TGCCTTGGTA 


420 


GCTAAGCGTG 


GAGCCTATAT 


GGAAGAAGCG 


GCTCCTGCTG 


ACTCTGGCAA 


GATGGTAGCA 


480 


GTTCTC C ATA 


CCCCGTAGAG 


TCCTTGAGAA 


GCCTGTCCAA 


AGCTCTGACT 


TGAGTGGTAC 


540 


TCCAGCCACT 


ATACACACTG 


CACAATCTCA 


TTGCTGGAAA 


ATTGTTGCAG 


TGATCAACGG 


600 


TGACTTTGCA 


GAGCAGTGCC 


AACCCTATCC 


CCTAGT 
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(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



150 



WO 97/37026 PCT/US97/05306 

CTCTAAAACG AATTCTCTAC AAGCACCGCA AGGGCATGGC TGAACTTCCA CCATAAGGTG 6 0 

GTTTGTCTCG AAAGGCTAAT ACTTCNTTAA CCTTAGTTTG TCCTGAAAAT TGGTACATAT 12 0 

TGAAGAGGGC CGCCCGTTCT GCGCAGAGAT GGAAAACACC ACAGGTTCCC TCCATACAGA 180 
ATCCTGTAAA TATTTGTCCA TCTCCTGCTT CTACTGCAGC TACAACATGA TTGGCATAAA 240 
CAAAGTCTGA TACTTCATGT GGATTGTATA GTTTCTGTGC TTCTTCCTAC ATCTTTTCCC 
AGATGTCCAT TATTGTATCC TCTATATTTA GAGATTTCTT TTAGAATGTT TTCGATATGC 
TGAATTGATT TTTCACGTCC AAGCAAGAAA ATTGTATCTG GTAATTCTGG CCCATGCATT 
TCGCCTGAAA CTGCGATACG AATAGGCATG AAAAGATTTT TCCCTTTAAT ACCTGTTTCT 
TTTTGGACTG CTTTAATTTG CGGGAAGATA TTTTCTGTCA CAAATTCATC ATCTGTCATC 
CGCTTCAATT TTTGCTTTGA ATGCTTCAAG AACTGTTGGA ACTGTTTCAC CCGTCATGAC 
TTCGCGCTCT GCTTCTGTCA ATTCCTGGGA AATCTGAGAA GAAAGATCTG TCCATGGGAT 
ATCTCATCTA CTGATTCATT GTGGTTTATA GAG 



(2) INFORMATION FOR SEQ ID NO: 160; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 9: 

CTGACAAAGG AGACGGGTAT GG AACAAACA TTCTTTATCA TCAAACCAGA TGGTGTAAAA 6 0 

AGAGGGCTAG TGGGTGAAGT GTTAAAGCGC ATCGAACAAC GTGGATTTAC AATCGAAAAA 12 0 

TTGGAGTTTC GTTCACAGGT TTTCAGAAGA GTAGATTGAC CAGCACTATC AGGACCTGGT 180 
TGGTCAGAGT TTTTACCCAC CGATTCGTGA ATTCATGACT TCAGGTCCAG TTCTTGTGGG 
TGTCATTTCT GGTCCCAAAG TAATCGAAAC TTGGCGGACC ATGATGGGTG CAACTCGTCC 
AGAAGAAGCT TTACCAGGCA CTATTCGAGG TGATTTTGCA AAAGCTGCTG GAGAAAATGA 
GATTATCCAA AATGTTGTAC ATGGTTCAGA TTCCGAGAAG AATCAGCTAA GCCGAGAAAT 
TGCTCCTTTG GTTTTAAGAG TGGATTGGCT CAATCAATTG GATAAAAGCT CATTTGAATA 
GAAAGTATAG TCAATTAGTT TAAGACATGA CGCATGATAT CAAACTTTTT AGTTTTTGAT 
ATGGTGCGTT TTT 
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(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 160: 



CTAGAGATTT 


NCGAAGAGTA 


TTATTGTACT 


TTAAAGGTCT 


TGAGATAATT 


GTCTTTNCCT 


60 


ACTTGACCTT 


CGAAGGTTTT 


ACCATTTTCA 


AGGTAAGGAA 


GGTCATCGGA 


TACTGAAGCC 


120 


TTGACCTTGT 


ATATCTTGCC 


ATCAACTTTA 


AAGAAGTAGA 


CAGTGTCTCC 


CTTGATAACA 


180 


GCTGATTTGA 


GGTCTGCTAC 


TACTCCCTTG 


ATGCTTTCTG 


TCGTTGCATT 


GTCAATTTCA 


240 


AGGTCGTTTT 


TATTGGCATA 


CTTGCTGAGC 


ATCTCTTCCA 


CTGTAGTAGC 


AACGATAACA 


300 


TTTTGGTACT 


CGACTGCGTC 


TACCAGGGCG 


TACTCTTTGA 


CCAAGCCAGC 


ATTGTCCTTT 


360 


AAGCCCATGA 


TGTAAAGAGG 


CTTGTCATTG 


AGGTTGATAA 


AGATTGGGGA 


AAGGTTGCTT 


420 


TGTAGGATTT 


CTCCTGAACA 


GCACCTTCTG 


CTGATTCACG 


GGCTGATTCT 


TCTGTCGCAG 


480 


AAGCCAAG 












488 



(2) INFORMATION FOR SEQ ID NO: 161: 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



TACAGCTATC 


GGGTC CGTCT 


GGTAGTTGGT 


CCTCAGAACC 


TTC AC TGCC A 


CTTCTTCCCC 


60 


ATCTAAGATT 


AAGTCTTTGG 


CTAGGTAGAC 


ATCCGCCATG 


CCTCCTCGAC 


CAATCTGTTT 


120 


GACAATCCGA 


TAGCGTCCGG 


CAAAAATCTT 


GCCGATTTGG 


ATCATTCTTC 


ATCCTCCTCG 


180 


TTCATAGAAA 


CAAGGGCAAC 


CGTAATGTTG 


TCTAAACCTC 


CTGCATTGTT 


AGCAAAACGA . 


240 


ACAAGTGTCT 


CCGTTTTATC 


TGC TAAAGG A 


ATATCACTGG 


TTACAATATC 


ACGAATCTCA 


300 


CTGCCTGAAA 


TCATGTTGGT 


CAAGCCGTCA 


CTATTGAGCA 


AGAGATAGTC 


ACCTGACTCA 


360 


AGGATAACTG 


TCCCAAAATC 


AGGCTGAATT 


TCATCTTTTT 


GCCCAATAGA 


CTGGGTGATA 


420 


ATATTTTTTT 


GTGGATGACT 


TCTGCCTCTT 


CTGGTGTCAA 


TTGACCAGCC 


TTGAGCAATT 


480 


CATTAACCAA 


AGAATGATCG 


CTCGTCAACT 


GATGGTATCT 


TCTCCACGAA 


TCAAGCCGAT 


540 


ACG 
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(2) INFORMATION FOR SEQ ID NO: 162: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

CTTTAAGGAA AATCAAATCT CTCATGCTGA TACCTCTCCT CATTAAATTA AATAGTAAAA 
AAGATTC TAT CTCACTCCCT GATTATTACA AAACCATTGA AATATCACAA CTAATAGGCT 
AGAATGGACA TAGTAAGATN TAGTAGATGA GTCATTCTAC TCAAATCCAC GTTAGAAAGG 
ACTGCTATGC CAGACAATCT CGCGCTTCGC ATGCGCCCTA AAACCATCGA CCAGGTCATC 
GGTCAGGAGC NTCTGGTCGG ACCTGGAAAA ATCATCCGCC GCATGGTGGA AGCCAACCGC 
CTGTCCTCCA TGATTCTATA TGGCCCTCCT GGAATCGGCA AAACCAGTAT TGCCTC^S 
ATCGCTGGAA CGACCAAGTA TGCCTTTCGA ACTTTCAATG CGACAGTTGA TAGTTAAAAA 
GCGACTGCAA GAAATCTCGG AAGAAGCTAA ATTTNCTGGT GGTCTCGTCC TATTGCTAGA 480 
CGANATTCAT CGACTAGATA GACC ^ 480 

504 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 : 

CTAATAGAGG CGAAAAAATT TCAGGTCCTC CTTTGCTAGA TGATAATGAG GAAACTAAGA 
TTTTACCAAC CTCTTCTTCC CGTTTTGGTT ATGCCAATCC TAAGGATCAT GGTTTTAGCC 
AGGAAACCTT GAAGATTCAG GTCGAACCAT CTATTCATAA AAGCCGTCGT ATTGaZta 
CCAAGAGAAA TGTCTTCAAT TCTAAGTTGA ATAAAATCTT ATTTGCGGTC ATCTTTCTCT 
Tc^ZtZ TC ™ GCA ATGAAACTTT TGTAATAGAA AAGGAATTGA JIIZZ 
GGAATTATTG CTGCTATGCC AGAAGAACTG GCTTATCTGG TCCAGCATTT AGATAATGCC 
CAGGAGCAAG TTGTTTTGGG GAATACCTAT CATACAGGAA ACCATTGCTT CkI^ZI 
TCGTTCTTGT TAAAAAGTGG AATTGGTAAG GTCATGTCTG CTATGAATTG TGTGGCGAAT 
TTTGGCTGAT CATTTCCAGG TTGGATGCCC TTATTAATAC GGGTTCAG 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



CTAATAAGAC 


TGAAAAGAAG 


TCCAGCCAGA 


TGTCCCTAAA 


AATACAGAAA 


AAACATTAAA 


60 


ACCAAAGGAA 


ATCAAATTTA 


ATTCTTGGGA 


AGAATTGTTA 


AAATGGGAAC 


CAGGTGCTCG 


120 


TGAAGATGAT 


GCTATTAACC 


GCGGATCTGT 


TGTCCTCGCT 


TC AC GTCGG A 


CGGTCATTTA 


180 


GTCAATGAAA 


AAGCTAGCAA 


GGAAGCAAAA 


GTTCAAGCCT 


TATCAAACAC 


CAATTCTAAA 


240 


GCAAAAGACC 


ATGCTTCTGT 


TGGTGGAGAA 


GAGTTCAAGG 


CCTATGCTTT 


TGACTATTGG 


300 


CAATATCTAG 


ATTCAATGGT 


CTTCTGGGAA 


GGTCTCGTAC 


CAACTCCTGA 


CGTTATTGAT 


360 


GCAGGTCACG 


TAACGGGGTT 


CCTGTATACG 


GTACACTCTT 


CTTCAACTGG 


TCTAATAGTA 


420 


TTGCAGATCA 


AGAAAGATTT 


GCTGAAGCTT 


TGAAGCAAGA 


CGCAGATGGT 


AG 


472 



(2) INFORMATION FOR SEQ ID NO: 16 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



CTTTTAAATA 


AGAAGAATCA 


CACTCATTGC 


TAAATGTGAT 


ATAATATTGT 


TCTCAGTCTC 


60 


AGCAGTGCGT 


TGTGAAAAAG 


CCATACTAAT 


CCCTGATTTT 


TCCGCATATT 


CCATCACCTC 


120 


AAGTGTTTTA 


CTCACTGAGC 


CAACTTGATT 


AGGCTTTATT 


ACAATACCAT 


CAGCACAATC 


180 


TTTAAAC TGT 


GAAATTCTTT 


CTAAGTTTGT 


TGCGTAAAAA 


TCATCACCAA 


AAACTTGTAA 


240 


CTTTAGAGGT 


TTTATCAATT 


GAAATTTTCT 


CCAACTATCC 


AAATCTTCAT 


CGGAAAATGG 


300 


ATCTTCAAGA 


TAAACCAATG 


GATACTTAAC 


TCCCCAATCA 


CAATAAGTAT 


CCATAATTTC 


360 


TGTGACTGTC 


TGTTGCTGAC 


TAACACACCA 


TGGTACTTTA 


TATATGCCAA 


GTGAAGAATC 


420 


GTAACGATCT 


GTCATTGCTA 


AATCTAAACC 


GATATCATAT 


CTATTTTTGT 


ATAATTTTAA 


480 


TGTC TCTAAT 


AACGAATCCA 


AAATAATATA 


AAAATCATCC 


GTGTTAATTA 


TTAATGCACC 


540 


TTGATAGAAA 


TAGATGTTGT 


AGAAACCTGG 


GTTTTAACTT 


TATCATAACT 


CATTTTTATG 


600 


TTAGA 












605 



(2) INFORMATION FOR SEQ ID NO: 16 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(XX) SEQUENCE DESCRIPTION: SEQ ID NO : 16 6: 

CTGAAGAAAA TCAGTAAACA AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT 
TTTGGACAGG AAAAAGACGG TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAr 
CAGGGACAAA CACTATCGGG CGCTCATGAT CTCTTGGACA GCCTCATTGA GCG TG ATT AT 
AAC TTGTATG CTGCGACAAA TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT 
GGTCTAGCAC CTTATTTCAA TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG 
GATGCTCTTT TTTATGAAAA GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACC 
TGATGATTCC AGATTCTCTA ACCGCCGACA TTCAAGGTGG CAATAATGCG GGGA^ACA 
ZrTZZ: CACCTCGAAA ™ ACA — G ACTTACGA^G 



(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: 

ATCTGCCTTT ™TGCTATA CTTAAGGTAT GCATAGAAAA ACAGTGATTG 
ATTNTAGGAC TTTAGGGGAG AGATATACC T TTACCCANCC TATTAAAGAG TTGAAA^A 

'^r^ AGAAGTCGCA «™ct» cacnagtgga .aagctaccaa gIgcIaga" 

ATTATGTGGT GGGGTATGTC AGCTACGAGG CTGCACCTGC TTTTGAGGAG AAAT^GCaI 
TTCACAAGGT TCCTCTACTG GGCGAGTACT TGCTTTACTT TACTGTTCAC G^I^G 

~TG C Sc™ ™ AGG 

NTAACGTCTG CACAGAACTA TGANAAGGCC ATTGCCCAGA TACAC 
(2> INFORMATION FOR SEQ ID NO: 16 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



CTCAAAAACA 


GCTTCTAACA 


GCTTGAACAA 


GGGGGCTAAA 


ATTGATTCCT 


TGATGTAGGG 


60 


TTTGAAGTAA 


GATAATAGOT 


GTTTCATAAA 


TCCCTTCTAT 


TCATATTCTA 


GAAATGAAGA 


120 


AAGTGGGAAG 


CCCCACTCTC 


TGTTTTATTT 


GTTTAAGTAA 


GGCAATAGAT 


AGCCATATCC 


180 


TGCTTTTTCC 


ATCTCATCCT 


TGGCCACAAA 


GCGTAAAGAA 


GCAGAATTGA 


TACAGTAACG 


240 


GAGGCCGCCT 


AACTCCCGCG 


GTCCATCTGT 


GAAAACATGA 


CCCAAGTGAG 


CACTGCCTGA 


300 


ACGAGAACGA 


ACTTCAATTC 


GCTCCATTCC 


ATGGCTCAGA 


TCCTTGTAAT 


AATGAATCAA 


360 


CTCTTTGGAA 


ATCGGACGGC 


TTAAAACTTG 


GCCAACCCAC 


AACCTGAAGC 


AAACTTATCC 


420 


TTGGCAAAAA 


AGAGTTGGCT 


CACCTGTTCG 


TAATATCTAC 


ATAAATCCCC 


TCCTTCCAAA 


480 


GGTTTTGGTC 


CATAGGCATT 


GGTAAATGGA 


CCTCCTGTTA 


CCAG 




524 



(2) INFORMATION FOR SEQ ID NO: 169: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 



CTTCATCTGG 


TGGTTCTCCA 


GTTGTAAGTA 


ACGTAGTACC 


ATTCTCAAAA 


CCATACTCAC 


60 


AAGTTGGTAT 


TAAGGGGGAA 


CCATATATCT 


TTAAAGGAAT 


GAAATTGCAA 


AAAGATATTG 


120 


TTACAACAAA 


AGAATATAAC 


GAGGTTTTAA 


AAAATGGCAA 


AAAGAAAAAT 


TGGAATCCAA 


180 


TAGCAAATAC 


CAAAAAGAAC 


TAGAAAAATA 


CATTAAATAA 


GGAATGGTAT 


TGATCTTGAT 


240 


AAAATTTTTA 


AAATACTGTC 


ATTTTGAATA 


TAAAGGAGTT 


TGATATGGAG 


TGGATTAGAT 


300 


TAATAGGAAT 


AGCAATCATT 


GTTGTGGGTT 


TTATTTTAAA 


ATTTGATACA 


ATTGCAACAG 


360 


TAGTCTTAGC 


TGGTTTGGTT 


ACAGCTTTAG 


TTTCAGGTGT 


TTCTCTCGTT 


GAATTTTTGG 


420 


AGATTTTGGG 


AAAAGAATTT 


AGCAATCAGC 


GAGTGCTCAC 


GATTTTTATG 


GTTACCTTGC 


480 


CTCTTGTGGG 


GCTGTCAGAA 


ACCTTTGGAC 


TCAACAACGA 


TCAATCGATT 


TGATTCGAAA 


540 


GATTAAAGGT 


CTGACAGTTG GAAACTTCTA 


TAC AGTTT AT 


TCTTTATCGA 


GAG 


593 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : . Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

CCTGCTCCTT ATCTTTGCAG TAATTGGCGT TTTAGTGGCC TTGATAGCTC AATTTTACTC 
AGCAAAGGCA GCAGTAGGTT TTGCTAAGGA ATTGACAAAC GATCTTTATC GTCATATTCT 
TTCCTTGCCC AAGGACAGCA GAGACCGTCT GACAACTTCT AGTTTGGTTA CTCGCTTGAC 
TTCGGATACC TACCAGATTC AGACTGGTAT CAATCAATTC CTGCGTCTCT TTTTACGAGC 
GCCCATTATC GTTTTTGGTG CCATTTTTAT GGCTTATCGA ATCTCAGCTG AGTTGACTTT 
CTGGTTCTTA GTCATGGTTG CCATTTTGAC CATTGTCATT GTTAGGGTTA TCTCGATTGG 
TCAATCCTCT CTACAGTAGT CTCAGAAAGA AAACCGAACC ACTGGGTTCA NGAAACCCCC 
CNCCATTGCA AGGATGCCGG GTTATTCCGT GCCTTT 

(2) INFORMATION FOR SEQ ID NO: 171: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 65 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

CTTGTACAGT TGAATCGTTC AGATATTGAT TTTGAAAACA GAGAGTGTGT TGTCTTTGGT 
AAAGGAAAGA AGGAGAGACC AGTATATTTT GACGCTCGGA CGAAAATTCA TTTAAGAAAT 
TATCTTAACG ACAGAAAAAG ATAGTCACCC TGCTCTTTTT GTAACGCTAG TTGGAAAAGC 
CCAGAGACTT GGAATTGCTG GTGTAGAGAT TCGCTTAAGA AAGTTAGGAG ACAAACTCGG 
CATACAAAAG GTTCACCCAC ATAAGTTCAG AAGAACTTTA GCGACTAAGG CAATTGATAA 
AGGTATGCCT ATCGAACAAG TCCAAAAACT GCTAGGTCAT AGCAAGATTG ACACAACCCT 
GGCCTATGCC ATGGTCAATC AAAATAATGT CAAGCATTCA CACCAAAAAT TCATCTCTTA 
AAAGCAAATC CCGATATTCC GAAGAGGTGG TTGCATGAAG AAAGTGAAAT TAGGTGAAGT 
GGCTACTTTT ATCAATGGCT ATGCTTTTAA ACCTCAAGAT TGGTCCTCTG AAGGAAAGAG 
ATTATCCGAA TTACCGAATC TGACT 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDNESS: single 
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( D ) TO POLOGY : 1 inear 
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<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



CTTTATACTT 


TATCATTTCT 


AACTTAATTA 


TAGTCTTATT 


TTTAATAAAG 


TTCAATTATT 


60 


TATATGTAAA 


ATTCCTCATC 


AGTAAAAAAT 


AACCGATCTC 


ATTTCTGAGA 


ATCGGTTTTC 


120 


TAAATAAATC 


AAACCGATCA 


TTTACATAAC 


ATAAATTATG 


TAAATGATCA 


TACTACAACA 


180 


ACAAATCTTT 


GACTTTTC C A 


ATTTCACTTT 


TTGGAATAAC 


CAGGTGAATC 


ATATCACCCA 


240 


GATACATTCT 


GGTTGAGCCG 


TTAACTGTTT 


GGCTCTTGCC 


ATTATGGACT 


TGAGTTGTGA 


300 


TGAGGACGTT 


GTGTGGTAAG 


TTGAGTTCAT 


GAACTTGTTT 


CCCAGCAATT 


TTATCAGAAA 


360 


CTGGTATTTC 


GATAAGTGTA 


ACTTCTCCTT 


CGCTAGATAC 


TTCTTCTGGG 


AAGCATTTTT 


420 


TCCAGCATGG 


CTTCATAGAC 


TGGCGTTCCT 


TTGAGCNAAT 


CCCTGATNAT 


TTTAGAAACC 


480 


AGAGTGACAG 


ACCAGTGGCT 


TAAGGTGCGA 


ATATCTCCTA 


CCATCTCAGT 


TACGAGAT 


538 



(2) INFORMATION FOR SEQ ID NO:173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGTGCTAACC 


ATGAGAGCAT 


ATATTTCCTC 


CAACTATTTC 


TATTTCAAAA 


AATCCTTACT 


60 


ATATTGTATC 


ACAATCAGAC 


ACAAAAAGAA 


AAAGCAAATG 


ATAAACAAAT 


GCTTTTAANG 


120 


TTTTAAAAAA 


AGCTTCGAAA 


GGTTCTTCTT 


TATTTTTTAA 


NGGGAGAGAT 


AACGTTGATA 


ieo 


TCTAAATCGT 


GGTCAAAGCC 


GGCAATTTTC 


CTTTAGATGT 


GTATTGGTGA 


ATATCATAAT 


240 


CTAAATCAGT 


TTTAGGACTG 


CTCTCCAAAA 


ATC CTGAGTC 


TGAGCCGTAG 


GACGGAATCC 


300 


AAACAGAGGT 


AAACTTGCCT 


GTATCAATAC 


TGTGTTCTTC 


CATGAAGTAG 


ACACCAACGT 


360 


AGATGCCGAT 


GTTTTTAGCA 


CCTAGTGATG 


CTAGTTTTGC 


TCGAAAGTTT 


TCGACACCTT 


420 


CGTTCATATA 


GACATAGTTT 


TGTCTCCACG 


TCAGCCCATA 


GTAACTAGGC 


TGTTAGGAGA 


480 


GCACATTGTA 


GAAAACTTCG 


GCAGCCTTT 








509 



(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

CTGTTCTGTT TGCGACGCTA CTTGGTTGTA GCATGGGGGT GGTTTTAGAT GGTCAGTAAG 
TATCTTTTAT TAGCAGTTAT TTTCTCTGGC TTGGTGACTT GGATTCTCCG TATGATTTCC 
TTCATCTTAG TCAAGTATAA GGGCTTGCCT GCAATCGTTG AGCGTTTTTT GAAGTTCTTG 
CCCGTTTCCA TTATCTTTGC CTTGATTCTT TCAAGCGTAG TGACAGGTAA GGTTGGGAGC 
CTTCCTCAAA TTAAATGGCT AGACTTCTTA GCCGTCTTTC CAACAGCTTG GGTAGCCTTT 
CGCTACCGCA ATCTACTCGG AACAGTTCTC TTTGGAGTGG TCTTGATTGC CATCTTGCGT 
TTGGTCTCTT AAATTACCCA CCAAAAAAAC TTATCACAGA GATAGATATC ATATAATGGC 
GTAAATGCTC CTTTTCTGTT AAGATTATAA GGTATTCTAT TTTGGAGGAA ATGACATGAA 
AAAAATCGTT AAATACTCAT CTCTTGCTGC CCTAGGACTT GTTGCTGCAG TGTGCTTGCG 
GCTTGCTCAG GGGTGCTCAG AAAGAAGAGA AC 

(2) INFORMAT I OM FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

CTAGGTTATT TAGTAAAAGA CTAGAAGATT TTCTTTTC CC AATCGTCTTC TGTACGGCGA 
GGATAGAAAA ACTCTGACTT GTCGGGAGCT TCCATCATTT CCATCAATGG TAGCATATCA 
TAGGCCAGAT TTAAGTTTGG AATCTGGTCT TTTTGCACCC AAGAAACTTC TCCCTCTTCT 
GAAGATTGAA GGGTACCAGA GAACTCAGTC GCCTTATAAC AAATGACAAT ATAGCGCCCA 
CCTCTATCTA GTGGCCAATT TTTAATGCCG ACAAGTTGAG GATTTTGGAT AGTCAACCCT 
GTTTCTTCGT AGATTTCACG AATGACAGAC TCCGCAAAAG CCTCATCATT TTCTACATGA 
CCTCCAGGAA AGGCATAACC AGACCAGCGA TTGTTTTCAG GGGCGCGATA CTGCATCACC 
ACGCGCTGAG TTTCGANGTC TTCCATCAGA CAGATATTTG TTTAAATTGT TTAATTGGGA 
ACGGGACATA AATTTAC 

(2) INFORMATION FOR SEQ ID NO: 17 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 425 base pairs 
. (B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/053O6 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



CTGAAATTTT 


GGAAGAAGAA 


ATCGGTGTAG 


ATACAGTCCA 


AAAAATAGGA 


CGCATCTTGA 


60 


TTTTGTTTAA 


ACAATCTAGC 


AAGAAAGAAA 


ATCGCAAGAT 


TTCTAAGAAA 


GTCAAAGAAA 


120 


TCTAAGATCG 


AAACTCCAAA 


TAACTGTTTT 


TATAGAGAAA 


TAAAGGGGAC 


TAGCCTATGA 


180 


CAATCGAACT 


ATTGACTCCC 


TTTACCAAGG 


TAGAGTTGGA 


GCCAGAAATC 


AAGGAGAAAA 


240 


AACGCAAACA 


AGTTGGGATT 


TTAGGGGGGA 


ATTTTAACCC 


TGTTCACAAT 


GCCCATCTCA 


300 


TTGTTGCGGA 


TCAAGTACGG 


CAACAGTTGG 


GACTGGATCA 


AGTTCTGCTC 


ATGCCTGAAT 


360 


ACCAACCTCC 


TCACGTTTGA 


TAAAAAGGAA. 


ACCATCCCTG 


AACACCATCG 


TCTCAAGATG 


420 


CTTGA 












425 



(2) INFORMATION FOR SEQ ID NO:177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



CTGCTATCAA 


GACAGTATTA 


C CGCTG AAAG 


AAACAGCTTC 


AC T AG AAGGC 


GCAATTGAAA 


60 


AATAAGACGG 


GATCACGAAC 


TCCCGTTTTT 


C T AT AAAAG A 


AAGGAAATGG 


GATGAAAAAA 


120 


TTAGTCTTTG 


TCTGTCTGGG 


AAATATTTGC 


CGTAGCCCTA 


TGGCCGAGTT 


TGTTATGAAA 


180 


TCAATGACAG 


ATAACTACGA 


AATCCAAAGT 


CGAGCAACTT 


CCTCTTGGGA ACATGGCAAT 


240 


CCGATTCATA 


AGGGGACTCA 


GGGAATTTTT 


CAAGAGTATG 


AGATTCCTTA 


TGACAAGAAC 


300 


AAGACATCGC 


TTCAGATTAG 


TAAGGAAGAT 


TTTGAAGCCT 


TTGATTATAT 


TATCGGAATG 


360 


GACGCTTCAA 


ATGTTCCGAC 


TTACGTCAGA 


TGTGTCCAGT 


AGACTGTCAA 


GATAAGAT 


418 



(2) INFORMATION FOR SEQ ID NO: 17 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 180; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

ACTGTTTGAA AAATATGATT CGTGGTCGAG AAATGAATTG CATTTAAGCA ATGTAGTTCA 
GTATATAGAT TTGGAAATTA ATGATTTAAC AAAATAAAGG AGAAAAAACA TGGTTAAATA 
CGG TGTTGTT GGAGCAGGGT ATTTTGGAGC TGAATTGGCT CGCTATATGC AAAAGAATGA 
TGGAGCAGAN ATTACTCTTC TCTATGATCC AGATAATGCA GAGGCGATTG CAGAAGAATT 
GGGAGCAAAA GTTAGCAAGT TCCTTAGATG AGTTGGTTTC TAGCGATGAA GTTAGATTGT 
GTTATCGTCG CAACTCCCAA ATAATCTTCA TAAGGAACCG GTTATTAAGG CTGCACAGCA 
TGGTAAAAAT GTTTTCCTGT TGAAAAA 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 179: 

CTTATGAAAT CAATCATCAA AATAATATTG ATCAGGACTA TTTAGGTAAA TTATCTACAA 60 
CGATTAAATT GGTAGCAGAA AAGGAAAATG CCGTTGAGAT CCTAGAACAC TTGAATGTTG 120 
TCCCTGTGTT GACAGCCCAT CCAACACAAG TGCAACGCAA AAGTATGTTG GATTTAACAA 
ATCATATTCA TAGTCTTTTG CGTAAATACC GTGATGTTAA GTTGGGGTTA ATCAATAAAG 
ATAAATGGCA CAATGATTTG CGTCGTTACA TCGAAATTAT CATGCAGACA GACATGATTC 
GTGAGAAAAA ATTAAAAGTG ACTAACGAAA TCACGAATGC TATGGAAATA TTACAACAGC 
TCCTTTTTGA AAGCTGTCCC TCATTTGACG ACGGAGTATA AGCGCTTAGC GCAAGCGCAT 
GGTCTGAATT TAAAACAGGC TAAACCAATC ACCATGGGTA. TGTGGATAGG TGGTGACCGT 
GAAGGAAATC CATTTGTTAC AGCAAAGAAC TTGAAGCAGT CTGCACTCAC TCAGTGTGAA 
GTCNTCCTGA ACTTACTATG A 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 



LunLL J. AA 1 L. 


A1A1 I VjH i. AV— 




1 U AL»t_ ACj A Ii 


IXjTTTGAGTA 


TAAATTACAA 


60 


TTAGCAGGAC 


AGACTTGGGG 


ATATTTAGAA 


TTTGAAACAA 


ATACAGAAAA 


ATATGGGAAA 


120 


GTATTGTTAA 


TTATAAAGGG 


TAAGAAGCGA 


CTTACGAACC 


AATTTCCTTT 


GGTACAAAAA 


180 


AATAAGAGTG 


GCTACTTATT 


TGAATATGCT 


CAGATGAATA 


CACTTTATCT 


TAATCAACAT 


240 


TCTTCCTACA 


AAAATGATGA 


AGATAGTCAT 


TCCTTTCCAA 


TTCAGATGGA 


GTTAGTTTCT 


300 


GATGAAATGA 


TTCCAAGAAA 


TTGAACAAGC 


TACTAAAAAT 


TCCGAATATC 


CGAAAAATTT 


360 


ATGATTTTAA 


CTTATGANGC 


GGACTCCCGA 


AAAACAATAT 


TATATCTGTT 


AGATGTTTGT 


420 


TATGCCCTGA 


TGCCCGAACT 


GGTCCAGTTA 


C AC TTGATTC 


CAGATTGTCC 


TGAGTATATC 


480 


CATCCAGTTC 


CGTACC 










496 



(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



CCACCCTCTA 


GCAAGTGGTG 


CTAATTCCTA 


TAGC CTAGAA 


GAAAACGAAA 


TCATCTACCA 


60 


AAAGTTAAAA 


AACTTGCTTA 


AAAACTGATA 


AAATACTTGC 


CAAACTTTTC 


AGAATCTGAT 


120 


AGACTAGTAT 


GGTAACAATC 


TATGGCTCGC 


AAAGAGACCA 


TGGCAGAAAG 


GAAATATTGC 


180 


AAAATGAAAA 


AAGATATCCA 


TCCAGAATAT 


CGCCCAGTTG 


TCTTCATGGA 


CACAACTACT 


240 


GGTTACCAAT 


TCCTTAGCGG 


TTCAACAAAA 


CGCTCTAACG 


AAACAGTTGA 


GTTCGAAGGC 


300 


G AAACTT AC C 


CATTGATCCG 


TGTGGAAATT 


TCATCAGACT 


CACACCCATT 


CTACACTGGA 


360 


CGTCAAAAGT 


TCACTCAAGC 


AGATGGACGC 


GTGGATCGTT 


TCAACAAAAA 


ATACGGTCTC 


420 


AAATAATGAT 


AAGAGAACAG 


TTTCGGCTGT 


TCTTTTTTGT 


TTCTTGAAAT 


CAACTGCTGT 


480 


TTTCATGTTC 


CAGACTCATC 


TGTA 








504 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

CTCAGGCGAT TACTAATTAC ATGACTTCTG CCTCAAACTT TAATGTCGAT GAGGCTAGCC 
. AATTCATTCA ACAATTTACA ATTACAAAAC AAATCGAACA AGTAGAAAAA CTATTAGAGG 
AGTAGCATGG AAACTGCATT AATTAGTGTG ATTGTGCCAG TCTATAATGT GGC^AGTAC 
CTAGAAAAAT CGATAGCTTC CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT 
°™»™> GTGCAACAGA TGAAAGTGGT CGCTTGTGTG ATTCAATCGC 
GACAGGGTCT CAGTGCTTCA TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAA^L 
ATGAAGCCAG GCTCACGGGG ATTATCTGAT TTTTATTGAC TCAGATGATT M^C^C 
AGAAATGATT CAGAGCTTAT ATGAGCAATT AGTTCAAGAA GATGCCGATG 

(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 183: 

CTTATACTTG CTTTTTTCCT TTTGAAAAAT GTTAATAAAT CGTTGGTAGT CTTCTCTAGG 
CATGGACAGA TCAATATCGT CGTCCCAAGG GATAAAGCCC TCATGTCGAA CCGCCCCAA^ 

ATCCAAAATT TCTAATTCAA ST"" 

CTCCTA^ ™^ ATCrrCAATG CTTAGTTGTT TCATTTTAAA 

TTTTTGAATT TATTTTTTAA GGCTAGGACA TGGTTTAAAA ATTCATAf a a 
AATGCTATCT TTTGTGAAGA CAAGTAGACT AATATAAGAG ATAGCTGATA AC^GAC^ 

== = =r = == r ~ 

AAAGATATGT CCCAAGTGGA TGAGTTGTTT CTATGGATGA AATGAT ^ AGT AT AGC T 

(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 



ACCATATCAA 


TAC I A1 LbAA 


x ipa nnp 








60 


TGAATGACCT 


AGACGATGGT 


GAATTTTACT 


ATCACGAGAT 


TATCGGTTTG 


GAAGTCTA1 O 




AGGGTGATAG 


CTTGGTTGGA 


ACCATCAAGG 


AAAATCCTGC 


AACCAGGTGC 


TAATGATGTC 


180 


TGGGTGGTCA 


AACGAAAAGG 


CAAACGTGAT 


TTGCTTTTAC 


CTTATATCCC 


ACCAGTGGTT 


240 


CTCAATGTTG 


AT ATTC C AAA 


TAAACGGGTC 


GATGTGGAAA 


TCTTAGAAGG 


GTTAGACGAT 


300 


GAAGATTGAT 


ATTTTAACCC 


TCTTTCCAGA 


GATGTTTTCT 


CCACTGGAGC 


ACTCAATCGT 


360 


TGGAAAGGCT 


CGAGAAAAAG 


GGCTCTTTGG 


ATATCCCAGT 


TNTCATAATT 


TTTCGAANAA 


420 


AATGCTGAAA 


AGG 










433 



(2) INFORMATION FOR SEQ ID NO: 185: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 : 



CTGCAACTTG 


AATTTCTCCT 


TCTTGTTTCA 


AAGCAAGATA 


AACAATTCGA 


GCCCCTCTTT 


60 


TTTCTAGCAA 


ATCCCCCATC 


TGGACAGATT 


GCATAAAGGA 


ACGAGAAGAA 


ACCTGATCAG 


120 


AATAAGTCTG 


AAACTCTTCT 


TTCGTGAGTG 


TTGTTAGTGC 


CATATACTTA 


CTTTCTATGT 


180 


TTTTTTCTTA 


ATGTTTTACG 


GAAATCAAGA 


GCAAGTCTTA 


ACAGAGGATA 


GAGAGGATGA 


240 


GTGGGCATTG 


TAAATTCACC 


CAAGTATTCT 


TCAATCGTTG 


GATTAAATTT 


TTCCTTAAAA 


300 


TGATAAAGTC 


CACCATTGAG 


AGAGTTTTC A 


ACACCACCTA 


AATTTTGCCA 


GACCATACCT 


360 


CGCTCAAAGG 


CATAGCGAGC 


CGTTTCATAC 


CATGTTAAAA 


TTGGTGCATT 


GTAACGTTTA 


420 


AAATCATCAT 


CCATACCAGC 


ATATATATTG 


ACAGAGGTAG 


TACC AAATTC 


CAAACTCAAA 


480 


GTACCGCTAA 


AGGAATCTCG 


CTTGACCTAC 


ATCTATATAT 


TCCTGCAAGA 


AGGTC AATTC 


540 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
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60 
120 
180 
240 



CTCAGGCTAA AAGAGTCCAC TGGACTCTTT TACTCCGTCC CATAACCAAT GACTTAATGG 
CGCGTTCAGC TAGATTATTG GAAAGGACCA GATGTCCGTC TTTCAAAATA GTCTTAAAGG 
TTTCTTCATA CTTGAGGCTG TATTCAATTG CCCTTCCTAG TTTTGAACCC GATAAAACTG 
ACTGACGCCG GCACCAAGCA AAGAAATCTT CCATTAGGGG TTGGAGCTCT TCTTGACGTT 
TCTGTAGTCG TTCATCAGCT GACAATGTCT CCCAGTCTCT TTCCAAGGCA AATAAC^T 30 0 
CACAATAGGC TAATCCTTTA G "AlAACTGAT 300 

321 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 187 : 

ZTrrr^ ^TTGAT TTTGGAGGAT GAGAGCGCTG GAAGGGAACT 

CC^r C TTGACTCAAG .^CNATATAT CGCAACATTT GACCAACAAG CGCCA^AT 
CGATGAAATC TTTAAACTAT AATCAGGAGT GGAAGTATGA GAAATATGTG GGTTGTAATC 
AAGGAAACCT ATCTTCNACA TGTCGAGTCA TGGAGTTTCT TCTTTATGGT GATTTCGCCG 
TTCCTCTTTT TAGGAATCTC TGTAGGAATT CGGCATCTCC AAGGTTCTTC 
AATAATAAAG TGGC ANTAC T GACAACAGTG CCATCTGTAN CATAAGGACT GAAGA^TGtI 
AATGGTGTTA ACTTCGACTA TAAAGATGAA GCAAGTGCCA CCAGAAGCAA TTAaT 

(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

^ZT TA GAGTGGTTCT ^ATCTGCCA TGACCACAAA TNAGGACAAT GATAC^TCTG 
AACGTTCAGA CTGCTATCGT AAAAGGACAG CGGGTGAGAC GCCCATGAAT GATC^CCA 
GTCATACCCA CGGAGGAAAT TATACAATAG CACGTTATCA GGAGGAAAAA TOG^CA 
«C ACTTCTA AAATATAGTC TC CAC TTATC ^„ MT TTTTTACAGG ££££ 
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(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 9: 



CTTGTCATCC 


TCACGAAAAT 


TGCGATTGTA 


AAGCACCAAG 


CCTTGACTCG 


TGATAGACTG 


60 


AATCATGCTT 


CTCTCATGTA 


CTCCTCAAGT 


CGTTTCATGG 


CTTCTTTGAT 


AGTCTCCATG 


120 


CTGGCTGCAT 


AAGATAGGCG 


GACGTAGCCT 


TCCCCGTAAC 


GTCCAAAGGC 


TGCACCAGGG 


180 


ATAAAGGCAA 


CGGCCTTCTT 


CTGAGCAAAA 


TCCTTCAGAA 


AAGCAAAGGA 


GTCTTGATTG 


240 


TAGCCCGCTG 


GAATTTTAGC 


AAAAATATAG 


AAGGCACCGT 


CTGGTTTGAT 


AATCTCAAAA 


300 


CCAAGAGCAG 


TCATTTTTTC 


GATGATATAG 


TCCCGACGTT 


GGATATATTC 


CTTCTTCATG 


360 


GGCTCCGCAT 


CGTTTTTACC 


AGCCGTCAAG 


GCTTCTACCG 


CAGCATGTTG 


CGCCATGGTA 


420 


TTTGCGGCAG 


TGACCAAGTT 


ACTGGTGACT 


CTTGATTAAC 


TGGGCTGTGA 


AGGTCNCANG 


480 


AGCAAAAATC 


AGCCCCAAAC 


GCCAACCTGT 


CATGGCATG 






519 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 



CTTCAGCTAT 


TCGTGGGATA 


TTAGATGTAC 


TTATTTAGGG 


GTTGAAATCA 


TATGAATATT 


60 


ACCAATTTGT 


TTTCTATCAA 


GACAGGATGT 


GATGAAACTG ATAGGCAACT 


GCAAAAACTA 


120 


TTTTTTCAGT 


TGGATTTACA 


ATTGGGAGAA 


TTGACAGATC 


AACTAAGAAA 


ATTAGATTCT 


180 


AATTTTGTTC 


CTCGTAGTCA 


ATTTGTAGAC 


ACGTTGGATT 


TGAATGATGT 


AGAATATAAA 


240 


GAAATTTTAA 


ACTATTTTAT 


CTTCCATCGT 


AATGATAGTG 


AAGAAAGTTT 


GGTAGAATGG 


300 


TTATATGATT 


GGATTTCCAC 


AAATCGTTAT 


GAACTTCCCT 


AAAGAGTTTT 


CCGATTCCGT 


360 


ATGGCTC CAT 


AAATACCCAT 


GAAAGTGTTA 


CTGAAGTTTT 


CCGGAGAATG 


AAT AAC T AAA 


420 


AAACAGTCAT 


TAGTGACTGT 


TTTTTATAGA 


AAAGAGTTTT 


TATATGTT 




468 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

C TTG AATAGT CTCATCACGA AAATTGCCAT AACTGATAAA ACCAACCACC TTCACGCCAT 
CCATCGCAAT CAATGTATTT TCTGGATACT TTTGACTAAA GAATCGACAT CTTTCTAATG 
TCATTGTATC CTGAAATTCC GCAGGCAAAA GGTCATCATA AGACTCTCTC CACGTTTGCC 
AGTGAACGAG GGATTTGCCT TCTATCTCTT CAGGAGTTNC CATAGATTTG ATAATAACCG 
TCATTTATTT TCTCCCAGTC TNCTCTCAAA ATAC CAT ATT TAATACTATC AAAATATTTA 
CCTTGATAAT AACGAACTTT NGGAATATGA GCTNCTTTTT TCATNCTTAA TTTTTCAGCA 
AGTTTCATCA TACCAAGATT TCCTGACCAA GTTGTCAAAA CCAGATGCTC CAACTCCAAG 
TAATCCTGAA ACGTCCTATC TATCCACTGC AACATAGCAA CTTTCCCAAT ACCAGTGTTC 
CAGAATTTTT TATCATAAAT ACCAATTCCC AATCCATCCA TCTTGTTTCC TTTACATACC 
LAATA 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

CTTGATTGTG GAAGAAGTCC GTGATCGCTA TGTGGGCAAA GTCGATGCCG TT TTTC ATAA 
CGGCGATTCT GAACTACGTC CGGATTCTCC ACTTTGGGAG GGCATCCGCG 

aZZ GGAC TTCTACGCCG ^CCCAGA ACGTCTGGTG ACTGAGCTTG GTTCGACCAA 
GATTATCCAA ACTCATGGTC ACTTGTTTGA CATCAATTTC AACTTTCAAA AG^C^ 
CTGGGCTCAG GAGGAAGAGG CCGCTATCTG CCTCTATGGT CACTTGCATG TGCC^C 
TTGC^GAA GGCAAGATCC TCTTTCTAAA TCCAGGCTCT ATCAGTCAAC ££££ 
CATCAGAGAA TGTCTCTATG CTCGTGTGGA GATTGATGAT AGTTACTTCA AAGTGGACTT 
TTTGAGACGA GATCACGAAG TGTATCCAGG TTGTCCAAGG AGTTTAGCCG A^I^H 
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(2) INFORMATION FOR SEQ ID NO:193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:193: 



CGTAACCATT 


AACTATGAGG 


TGGTTTGCCT 


CCTCAGCGAC 


CGTATTCCGA 


GAGAATATTA 


6 0 


TTAGAAAAGA 


AAGGAGTGGA 


GCATGAATCT 


ACATCAACCC 


TTGCATGTCT 


TGCCTGGTGT 


120 


GGGACCAAAG 


TCAGCAGAAA 


AATACGCCAA 


ACTAGGAATT 


GAAAACTTGC 


AAGATCTCTT 


180 


GCTCTACTTT 


CCTTTCCGTT 


ATGAAGACTT 


CAAAACCAAG 


CAGGTGCTGG 


AACTAGAAGA 


240 


CGGTGAGAAG 


GCAGTTCTTT 


CTGGTCAGGT 


AGTGACTCCT 


GCTAGTGTCC 


AGTATTATGG 


300 


TTTCCAAGCC 


GCCAATCCGC 


CTGCGTTTTT 


AGTCCTCCAA 


GCCAGGGAAG 


AAGGTTCGTT 


360 


TTTTTTTGGC 


GGGTGAAATT 


TCCTTTTTAA 


CCCAGCCCCC 


TATCCTTGGG 


CTTGAATAAA 


420 


AAATTAGAAA 


TTTTGGGGAA 


CCAACCCCTT 


TGGCTTGT 






458 



(2) INFORMATION FOR SEQ ID NO: 19 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 



CCTTACTAGT 


TTACTCAATC 


TTTACATCGT 


TCTTTCGGCA 


GTTCAAACTC 


AGAATCCACA 


60 


GCACAGGTTA 


TGCAATTACT 


GTCTGAAAAT 


ATGTTAAAAA 


CCATTCAGTC 


ACTTTCGGTC 


120 


TGGCAGATTT 


ATTTGCTTGG 


TTTTGAGCGA 


ATCTTGGCGC 


TTGGTTTCCA 


ATTACTTTTG 


180 


ACAGTTTGGG 


TTTACCAAGC 


TGTTCGCCAG 


AAGAAATGGA 


TTTATCTCCT 


AGCAGCCTAT 


240 


GGCTTGCATG 


CCTTCTTTGA 


TCTGGCACCA 


TCTCTTTTCC 


AAGTAGGCTG 


GTTGACAAAT 


300 


CCAGTCTTGG 


TTGAAGTGAT 


TCTAGCACTG 


GAGCTCGTTC 


TGGTCGCCTA 


TGGAACCAAG 


360 


GAAATCTTTT 


GTAAAAAATC 


ATAAAAAGGG 


GGGAACCTCT 


TTTTCTTATG 


CAAAATCCAA 


420 


ACAAGGTATT 


TTTATGGTCG 


TCAAATGTCT 


CTAAAAATGG 


TATAATGGAA 


TGAATTTTGT 


480 


AAAAGGAAGA 


ATGAC ATGTC 


TGTAAGAGAA 


AAAATGCTTG 


AAATCTTAGA 


AGGAATTGAT 


540 
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(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCT/US97/OS306 

566 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

CTTTGACAAA GAGTATGACA CCTGACCGTG AAGTCATTAC CTTTATTCCT GAAAAATTTA 
TTGTGGATGG TTTCCAAGGG ATTCGTGACC CACGTGGCAT GATGGGGGTT CGCCTTGAAA 
TGCGTGGTTT GCTTTATACA GGACCTCGTA CTATCTTGCA CAATTTGCGT AAGACGGTTG 
AGCGTGCAGG TGTTCAGGTT GAAAATGTTA TCATTTCACC ACTAGCAATG GTTCAGTCTG 
TTTTGAACGA AGGGGAACGT GAATTTGGTG CTACAGTGAT TGATATGGGG GCAGGTCAAA 3 00 

CGACTGTCGC TACAATCCGT AATCAAGAAC TCCAGTTCAC ACATATTCTC CAAGAAGTGG 360 
AGATTATGTA ACTAAAGATA TCTCCAAGGT TTTGAAAACC TCTCGCAAAT TAGCGGAAGG 
CTTGAAACTG AATTACGGGG AACCTATCCG CCTCTTGCAA GCAAAAAACT CCATTAAAGT 
TATTGGAGAA TTAAACCAGT CAAAGTGACG GAACCTACTT GTC 



(2) INFORMATION FOR SEQ ID NO: 196: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 



CTGTGATTTC AGAGAAGAAA TCAAGTGCTG TAACAGAAGT AAGATGTAAT TGTATGTAAA 
GGAGACGTCA TGTTAAATAG TATTGTAACC ATTATTTGTA TTGCCCTTAT CGCGTTTATC 
TTGTTTTGGT TTTTCAAAAA GCCTGAAAAA TCTGGACAAA AAGCCCAGCA AAAAAACGGA 
TACCAAGAGA TTCGAGTGGA AGTCATGGGA GGCTATACTC CTGAGTTGAT TGTCCTCAAG 
AAATCAGTGC CAGCCCGCAT TGTCTTTGAC CGCAAGGATC CTTCACCATG TCTGGATCAA 
ATTGTTTTTC CAGATTTTGG TGTACATGCG AACCTGCCAA TGGGGGAAGA GTATGTAGTG 
GAAATCACGC CTGAACAGGC TGGAGAGTTT GGCTTTGCTT GTGGTATGAA CATGATGCAC 
GGCAAGATGA TTGTAGAGTA GGTGGAGACT ATGACAGAAA TTGTGAAAGC AAGCTTAGAA 
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AATGGCATTC AAAAAATCCG TATCCGAGCT GAAAAAGGCT ATCATCCACC CATATCCA 53 8 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: . 



CCTCAAGAAG 


TCCGCGATAC 


TTACGAAAAT 


ATTTTGTAGA 


GAATCGAACC 


GCAAGGTTCG 


60 


GTTTTCTTTC 


TCTTTTTGTC 


TATAATTTGG 


TATAATAAAC 


AGTATGAAAA 


TCGTATCAGG 


120 


AATCTATGGG 


GGACGTCCCC 


TCAAGACACT 


AGAAGGCAAG 


ACAACAAGAC 


CTACTTCGGA 


180 


TAAGGTTAGG 


GGAGCCATTT 


TTAACATGAT 


TGGTCCCTAC 


TTTGAAGTGG 


GACNAGTCTT 


240 


GG AC CTTT AT 


GCAGGTAGTG 


GTGGTTTATC 


TATCGAAGCA 


GTATCGCGTG 


GCATGTCCAG 


300 


TGCTGTTTTG 


GTGGAGC GAG 


ACCGTAAGGC 


TCAGACCATC 


GTGGCTGAAA 


ATATCCAGAT 


360 


GACCAAGGAA 


GTTGGAAAAT 


TTCAACTCCT 


CCAAAATGGA 


TGCAGAAAGG 


GCATTGGAAC 


420 


AGGTTNTCTG 


GGGAATTTGA 


CCTCGTTTTC 


TTAGACCCTC 


CCTATGCCAA 


GG AACAAATC 


480 


GTAGCAGATA 


TTGAAAAAAT 


GGCTGAGAGA 


GAGCT 






515 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



CTGGANACAA 


ACTTT AC AGA 


TCAAGTCGAT 


ACGATGATTT 


ATGTTGATAA 


AGAAGAAAAA 


60 


GAAACTATTA 


AAGCTGCACT 


TGTGGAGTTT 


TTTAATGGAA 


AAGTCACTTT 


AACTGACCAA 


120 


GGTTTACGAG 


AGGTTGAAGT 


TCCTGTAAAC 


TTAGTGTAAA 


CAATGAATAA 


TACAGCGTTT 


180 


CGTTGACATT 


CTCACAACTA 


CTTTAGCGAG 


CAAAATAAAA 


AGATGCGTAC 


CAAAATATAC 


240 


TAGAAAATGA 


AGCAATTCAA 


ACGAAACCTG 


ATATCGTTTT 


CCTTCACACC 


TATTTACTAG 


300 


AATTAACTGA 


ACGCAATCAC 


TTGAAAATTA 


ATGACTTTGG 


AT 




342 



(2) INFORMATION FOR SEQ ID NO: 199: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 418 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



CTATTAGCGA CTTTCTCTGA AATATGGTAT GATAAAGGAT ATACAAGGAG ATAAAATGAA 
TAATAATTTA CTGGTATTAC AATCCAGACT TTGGTCTGGT TGATGGTGCG GT ATCGGCTA 
TCATTGGAGT GGCTTTAGAA GAGTCTCCAA CCTTAAAAAT CCATCACTTG AC^A^A 
TCACGCCTTA TAATATTTTT GAGGGGAGCT ATCGTCTCTT TCAGACGGTG GATTACTGGC 
TA"™ So™ TCGGTTGTCG ATCCAGGTGT CGGTTCGAAA CGTAAGAGTG 
TAGTTGCCAA GACTGCTCAA AAATCAATAC ATTGTCACGC CAGATAATGG GACGCTTTCC 
TTTATCAAGA AACACGTTGG CNTTGT ANC C ATTCGTGAGA TTTCTGAGGT GGCCAACT 

(2) INFORMATION FOR SEQ ID NO: 2 00: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 463 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 00: 

CTCTCGCTTT TCTCATAGTG GOAGGTAAGG ATGGAATTAC GCAGACCAAG ATTAGCGGAT 
T ^" T T^^C. CCC«^C cZZ^l 

GGATTTTGGG ATACAGAGAA TTTTGTGTAT GAAGAGTGGT TGGAAAGCAA TCAGGAACAP 

ZZZZZ ™* raoee TCAASGATO3 gt ™« ™™ 

A ^TTAAT CTCCGGTTGC GCCTCAGTAA CTTTC^I 
OAAGAAOGTG GGCACATTGG CTACTCCATC CGTCCATCTG AAAGAGGCAA GGGTTATGCA 
AAAGAACTCT CCGTCAGGGC TTGCAAGTTG CTAAGGAAAA GAACATCAAG AAAGCTCTGG 
TGACCTGTAG CGTGAATAAT CCTGCTAGCA GAGCATCATT CTA ^^TGG 

(2) INFORMATION FOR SEQ ID NO: 2 01: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 475 base pairs 
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DNA 

SEQ ID NO: 201 : 

ATCCGAGAAG CCATTCTATC TGGCTTACCA 60 

CTGGCTAAGG AAATCACTTC TCACAAAGAG 120 

GAGCGTAATG CTTATGGGCG CCAATTANGA 180 

TTGGCNAGAT TCCNATGACC TTTATCCGTG 240 

TANAAATTCT AGCAACAGTG AACNATCAAT 3 00 

TAAGTCCTTT TCATCCAGAA TTGACTGATN 3 60 

TGTGTAAAGA AAAAAGTNGA GATTGAATTT 4 20 

NCNATGTATT GGAGTACGGA CGCAG 4 75 

ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

( i i ) MOLECULE TYPE : Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 02: 



CCTTCAAAAA 


TTATTTGATG 


CAACCTTAGA 


CAATCCAGAA 


CTCACCGCAA 


CATTAGTGCC 


60 


TTTAGGAGAT 


GGTATTC TC A 


TGCTTCGTAA 


AAATGTAGCA 


GATGTTCAAC 


TGTCTGAAAG 


120 


CGAATGATTT 


TCAGAAAAAT 


TTAAGAAAAA 


ATAGTAAAAT 


AGATAGAGTA 


ACACTTATCT 


180 


CAAAGGAGTA 


GACATGAAGA 


AAAAATTATT 


GGCAGGTGCC 


ATCACACTAT 


TATCAGTAGC 


240 


AACTTTAGCA 


GCTTGTTCGA 


AAGGGTCAGA 


AGGAGCAGAC 


CTTATCAGCA 


TGAAAGGGGA 


300 


TGTCATCACA 


GAACATCAAT 


TTTATGAGCA 


AGTGAAAAAC 


AACCCTTCAG 


CCCAGCNAGT 


360 


CTTGTTAAAT 


ATGACCATCC 


AAAAAGTTTT 


TGAAAAAACA 


ATATGGCTCA 


GAGCTTGATG 


420 


ATAAAGAGGT 


TGATGATACT 


ATTGCCGAAG 


A 






451 



{2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic 

(xi ) SEQUENCE DESCRIPTION: 

CTCTTACGTG ACCAGAACAT GCTACTTCCC 

GTGTTTGGGA CCTGTGCGGG CTTAATTTTG 

AGTTATCTAG GAACTATGGA TATGGTGGTC 

AGTTCTNCAC GGAAGCNNAA TGTAAGGGAG 

GTCCGATTAT CAGTACTGTT GGTGAGGGTG 

TTGTTGCNNC CCAAGANNAA AATATGTTGG 

ATNTGCGCTT GCNCCAGTAC TTTATCAGTA 

CTCCAACTTT NCCACNTGTA ATNNNCAATN 

(2) INFORMATION FOR SEQ 



WO 97/37026 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: Genomic DMA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 203 



CTATGGATAA GGCACTTGCT GACCTCAAAA CATCAGGGCA CTTGCCTATT CCGCrA, 
TACGTGATGG GCACTACANT GGAAGCAAGG AACTGGGGAA TGCCCA^gIc 
CACACAACTA TCCTGNAAAT TGGGTCAAGC AAGACTATCT SaC^ 
ATCACTATTT CCAAGCAGAA TATACTGGTA AATATGAACG S^T^ ATTCGTAATC 
AAGCTATCGA CCATTTGCGA AAAATCTGAA ATCCTTTTCA AAAAATTGCA C^^ 
GATTTTTTTT GAAAAAGTGG TATC ata t> a a *~ AAAAATTGCA CTTTCCTCTT 

ACTTAAGTGT LACCCAC^ ££££ £~ 

TGTAGGCCGT CTCACACGGA AACAGCTTCA GTT ACAAAAGTCT TCTGTCAGCA 



(2) INFORMATION FOR SEQ ID NO: 2 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECUXE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 

TrZ^ — 

AAGAAGCAAT CCAGAAAGGA TTGAAAGAAT TAGATATTCC 
AAGTCATTTC TAGGGAGAAA AAAGGCTTTC TTCG^TCTATT TGGTAAAAA A 
TGGATATTGA AGCGATTAGT GAAACGACTG TTGTCAAAGC ^ZT^ ^AGCCCAAG 
GCGTTCCGAA AAAAATCAAT GATTTG A A rv» _ AAATCAACAG GTAGTAAAAG 

GTTGAC CTTG GTCATGTGGT TCATGCTATT ^^AAGAAACC 
TCTGATGAAG TC TGCTATT AAAAAAATAG AGGAAGAAGG TCAAGGTATT 



(2) INFORMATION FOR SEQ ID NO: 2 05 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:205: 

AAAATCAAGA AATTCTTTGA AGGAACGACA GATACAGCTG AGAACTATAT CAAGTCGGCC 
CTTAAAATGT TGGTCAAATA GGAGCAGAGA ATGACAAAAC GTTGTTCGTG GGTCAAGATG 
ACCAACCCGC TCTACATCGC CTATCATGAT GAGGAGTGGG GTCAGCCCCT CCATGATGAC 
CAAGCATTGT TTGAGTTGTT GTGTATGGAA ACCTATCAGG CAGGCCTGTC TTGGGAAACG 
GTACTCAACA AACGCCAAGC TTTCCGAGAA GCCTTTCCAT AGCTATCAAA TTCACTCCAG 
TCGCAGAGAT GACTGACACT GAATTGGAAG CCATGCCTGG AGAATCCCAG CTATCATTCC 
GAAATAGAGC CCAAGATTTT TGCTACACGC CGCCTAACGC CCCAACCCTT TCCTACAGTT 
ACAGGCAG 

(2) INFORMATION FOR SEQ ID NO: 2 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 



(2) INFORMATION FOR SEQ ID NO:207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



60 
120 
180 
240 
300 
360 
420 
428 



CCCAGACACT 


TACGGATCTA 


GC TG AG AC AG 


GGAATTGTCC 


60 


TAACTTGACC 


TTGGTGGTTA 


CCAAGTAAAT 


CTACAAAGGT 


120 


CGACAAACAT 


TGACTTGCTG 


TTTTCTTGGT 


CATTTGAAAT 


180 


GATTTTCAGC 


ACCTGAACGT 


ACCCAACCGA 


TACAGTTAGC 


240 


GTTCTCCATA 


GGCCAAATCT 


TTTCGGATGG 


CTAGGAGGCG 


300 


CTTGTTGAGC 


ATACTGGCCT 


GAAATCCCAT 


AGTAGTCTCC 


360 


CGTCTTGGCG 


TAACAAAATG 


AGGG C AT AGG 


CTGCTGGCTT 


420 


ACTCAAGGGC 


TTGTCCTCGT 


TGGGTATCGT 


GGTTGTCGAC 


480 


CTTGAGTTCA ACCAGGCT 






518 



(ii) MOLECULE TYPE: Genomic DNA 

174 



WO 97/37026 



POYUS97/0534J6 



60 
120 
180 
240 
300 
360 
420 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

CTCTGTCCTT TTACCTGAGA GTTTGAGCAG TTGCCTGCCT TGCCCCTTCG GTGCCTTTAC 
GGTCTCTCCA GAGTTCCGTC CATTTACAGT CATGGAAAAT CAAACGATTC CCCACTTCTA 
TTAAACTTCA TTCGGTGTTG GTATTTAATT GATTCTAATT TTACAAAAAA TGTTGGCTTT 
TGTCAATGTG TTTATTAGTA AAAATTAGTT CAACAGTTTT TACTTTATAA AGTCCA^ 
ACTGCTATCC TTTAAAAGTG ACAATAGTCG CACCACTGCC TCCAGCATTT TGTGGGGCAT 
AGCCGAAACT CTTGACATGT TTGTTTCTTT GCAAGTATTT GGTAACTCCT TcI^I 

clZlr™ GATACCATCG ATGATATC - OTTGAGCCAT ATTGTTAAGC AAGGCTTGGT 
CGAATGAAGG TATCTAGCTC ATTCATGGCT TCTTCATAGC GCTTGCCTCG AAGATTCAGT lan 
CTAGCTTGAG TCCTCGCCCA GAAGTTCG AAGATTCAGT 4 80 

508 

(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 
CTCTTGGGCT CTCTTCTCAG CAACTTCCCT GCTTTCACCT TTGGTCTTGA TAGGTGANAA 

a"™ GCATGGGAAA ™^ ™^ 

A ™ GCATGGGAAA CAGGTCATTC TTCTTGTCCG TAATATCGAC TCCCTCAAAG 

C^TCC^C ^™ T C ™ CTCCAAC ™™ ™ AGCGAAGCAA GGTAGATTTC 
CCACTCCCTG AAGGACCGAT GATAACGACA ACTTCTCCTC TTTTAATCTC GAGGTTGATG 
CCCTTCAATA CTTCATTCTT TCCAAAAGAT TTATGTAATT TTCAATTTTT ATCAAGGTTT 3 6 

CTGTCATTAT TTCTTATC aiuaacgttt 360 

378 

(2) INFORMATION FOR SEQ ID NO: 2 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 

175 
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CTGGCCAGCC AGATAAACCT CGATACGGCT TTCTGCTCGT TTGATGCGTT GCAAGAGTCC 
GCCCATACGG ACGTCAACTG TATCCAAACC AAAGACCTTG TTTTATTTCA GCCAGTGGTG 
ACTAAAGAGG GCATGGAAGT CTTCAATTTG GCTTCTAAGT TCTGGTAATT CTTGTCTGGC 
GATTTGTTGT AAACTTTCTT TATCATCCGC TTGGTAGGCA TGACGAATGC GTCGTCCCAC 
ATCTACTTTG CTACTTAAAA TAGCATTCAA CTGGGCCTGA GTTTCAAAGA GATAAGCATA 
GTTTCCAGCT TTTTCTTTAA TGTCAGCAAT AGTTCCCGCC GCCTGAGCGA AGTGTGGCTT 
GTCCTGTTCA GGTGTCATGT GTCGGTCAAG TATCGGACAG AGAACATCCT GATAAAAGAC 
ATAGCGGTTG GGATTGATGC CACTGAGATT ACCTGGTAGG TCTGGTAAGA GGTTGGCAAG 
ATCAATCTGC ATAAAATCCT CAACCGTTAG ACCAGTATTG GTCTTGAAAT GCGCAGACAA 
ACTATCTAGG TCATTGCGAT AGATCAAGTT ATCGATACCG TCGACCTCGA GGGGGGCCGT 
ACCCAGATTT GCAAGCTTGG TAGGATAAAG AACTGGGCAG TTTCACCACC ATIGTCTCCC 
CAACCCGTTA CGATGACTTC TTTAATCTGA TTGGCACGGC AGGCTTTATT AGCCTCGATA 
GCCACTAGAC GGCTAAAATG G 

(2) INFORMATION FOR SEQ ID NO: 2 10: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LEKGTH: 1349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

AAACTCGATT TGAGAGAAAG TCCAATAAGT CTTTCCATAA TAAAACGCAT AGTAGCAAGA 
GTTTCTACAC CCACTATTAT GCGTTTTTAT CATTTTGAAA TCTACTGCTA ATCAATGAAG 

aIgc^aagat caaactaggc agctagttac aagttacgta agcgcgtcat AACAAGGTAT 

C^ATCCTAAT TCCCCAACTT CCAGTATACT AGATAAAAAA TAATTATCGG AGGAAAGGTA 

SgSctatg ataattcaac taagtgattt aggtcaagtt caccttgttt gtggcaagac 

AGATATGAGG CAGGGAATAG ACTCATTAGC CTATGTAGTT AAAACCCACT TTGAATTGGA 
TCCTTTCTCC GGTCAAGCCT TTCTCTTTTG TGGTGGACGT AAAGACCGCT TTAAAGCCCT 
TTACTGGGAT GGTCAAGGAT TTTGGCTACT ATATAAACGC TTTGAGAACG GAAAACTGAC 
SgaccIagt ACAGAAAAGG ATGTCAAAGC TCTCACACCA gaacaagtag actggcttat 
GAAfaGGCTTT TCTATCACTC CCCAAATATA GTGGATTTAA ACTAGAATAG TACACCCCTA 
c^taLac attgttagaa atcgatttga CTGTCCTGAT caatttgtcc TGTTCTTATT 
tcIttttact atagaatcca tctgaaagcc gtttatgatt tctattgaaa tgaagaccgt 
ccatt^agt agactaaagg attactcaaa acttcgagaa ggaagacaga tgaacttgtt 
tcttataaaa atgttgtctg aaactattcg cttttttcca gaaattttag aggaagaaaa 
"tTtgagg aaaaaggagt tactcaaata acaattcagg attaaaaata gacagttgag 
gIgcggaagg tataaattaa gtttgctact gtataatgga tttaaatcac tcagcagaca 
gaI™ tcttcgaaaa tctcttcaaa ccacgtcagc ttcgccttgc cgtagatcca 

176 
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A 



TAGGGGACTG GACTTCATCA GTTCTATCAA CAACCTCAAA ACAGTGTTTC GAGCTGACTT 
CGATCAATTT TATCTGCACC TCAAAGCTGT ACTTTGAGCA AGCCTGACGA CTAGCTTCCT 
ATTTGATTTT CATTGAATAT CAGAAACTCA TTCTCCATCA AATAATTCGA CTGCGTCTAA 
TAATTTTTGG TCTGGCACGG TGTCAGAAAT AAAGGTTGTG TATTTGGAGA GGGGATTAAT 
TTTAAAAAAT CCAGTCTTGT AAAATTTAGA ACTATCAATC AGTAAGATGG TTTCATGGGC 
TTTGTCAATA ATATTCTTTT TTGAAATAG 



1080 
1140 
1200 
1260 
1320 
1349 



(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY- linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 



CTTTTTTATA 

TCCGATTAAC 

GATTCATGAC 

CTTTCCACAC 

CAATATTAGT 

GTAAAAAATA 

CAGTATTACA 

CTCCATAACA 

TTGGAGACCA 

TCAAGCGAAC 

AGATTTTATT 

GTGGCATAAT 

AGGAACGACC 

CTGCCTCTGT 

ATGCATGGTT 

TAAAAGTTCT 

CTAGAATAGT 

GATTTGCCCT 

AGCTGTACGA 

CATATCACTA 

TAGGTGCTAT 

ACTTAGAGGC 

TTGGTTATAC 



AGCAATTCAA 

GATGGACTTT 

ATCTTCCAAA 

TTGTTCAATG 

CGGAATCTTT 

ATCATCTGGA 

TCTACCAGCT 

ATAGTGAAAT 

ACAAGATCCC 

CTTATGAAAA 

TTTAGACGCG 

ATTTCGCCAG 

TAACTCCTTT 

AGCCTTCAAA 

ACACCACCAC 

ATAATCTCTT 

ACACCTCTAC 

ATTCTTGTTT 

GAGTCTTTTA 

TTATATAATG 

GGATACACCG 

CCACATCAAG 

GACTTGGGGC 



TTTTATACTC 

ATCACCTCCT 

ATTCGAAAGG 

GGGTTCATCT 

AAGACACTTG 

TAAGCTTGTG 

ATTAAGAAAA 

TCTCGTATAT 

AGTTTACTGA 

CGTCTACTGG 

ACAATGGTTT 

CCATGGCGCT 

TTATAAGCCT 

TGGCGGGCAA 

TTTCCTGATA 

TATAAGATTT 

TTCTAAAACA 

CGTTTTACTA 

AAAGTGTTTT 

CTTTTTGATC 

GATGAGAATG 

GCCATGCGAG 

TGTATCGATC 



ATCTGCTTTC 

TCTCCAATCC 

CTTTATTCTT 

CTGGTGTGTA 

ATTTATGCCA 

AAAGCTCTTC 

AATGATTCCG 

AGTGACTATG 

TTCTACCGAA 

TTTAAATCGC 

GAGCGTCTGC 

TCAACAGTTG 

GAAATAAGGC 

GAAATACTTT 

GAGTTGTTCA 

GCCCATCAGA 

TTGTTAGAAA 

TAGAACGATT 

GATGGCTTGG 

TTTAGTTTAG 

GTTATGTAGC 

ATGCCATTTA 

CAGTTTCAG 



AAAAAGCATT 

TTGTATGACA 

AAATCCACGT 

TGGAGGAATA 

TATAGCATTG 

TAAAAAGGCG 

CCTGTTATGG 

GACATGTGGA 

ACCAGCCTCA 

TTTCTTGTCT 

TTTCTTAGGA 

ATAGAAGGCA 

ATCAATTGTA 

CTCTTCCTCA 

CATCTTATTT 

CGAAATATAA 

TCGATTTGAC 

TGAAGGCGTT 

ATTTCTTCTT 

TATTATCGTA 

AGATGACTAT 

CCNAGACGGG 



CTAGTCCATC 

TCTTGAAGTT 

TTACGAATCT 

AATGCAAAAC 

TCCATAACGA 

TTCATCCACT 

GCATCAACAG 

CCTACTCCTA 

TCTTGGTACA 

TCTTGAATTG 

TGTTCTGGAC 

TCACGTGTGT 

ACAAATTCTC 

ACTGTCATAT 

CAAACTACTA 

TAAATTGAAA 

TGTCCTGATC 

TATGATATTG 

TAGTTGATTT 

GAAAATGGAC 

CGGATTACTT 

GTTGACTTGC 
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(2) INFORMATION FOR SEQ ID NO: 2 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



PTAHTATTTC 


CGTTGTGNTT 


AGTTTGTCTG 


CCTATGGAGT 


TATTGCCTAT 


GTGCAAGGTC 


60 


AGNTGGATAT 


TCTTCTAGTG 


ATTCTGGCCA 


TGATTGGTGG 


TTTGCTCAGT 


TTCTTCATCT 


120 




GCCTGCTAAG 


ATCTTTATGG 


GTGATGTGGG 


AAGTTTGGCT 


TTAGGTGGAA 


180 


TGCTGGCAGC 


TATCTCTATG 


GCTCTCCACC 


AAGAATGGAC 


TCTCTTGATT 


ATCGGAATTG 


240 


TGTATGTTTT 


TGAAACAACT 


TCTGTTATGA 


TGCAAGTCAG 


TTATTTCAAA 


CTGACAGGTG 


300 


GTAAACGTAT 


TTTCCGTATG 


ACGCCTGTAC 


ATCACCATTT 


TGAGCTTGGG 


GGATTGTCTG 


360 


GTAAAGGAAA 


TCCTTGGAGC 


GAGTGGAAGG 


TTGACTTCTT 


CTTTTGGGGA 


GTTGGGCTTC 


420 


TAGCAAGTCT 


CCTGACCCTA 


GCAATTTTAT 


ATTTGATGTA 


AG AATGGC AC 


CCTGATGTTT 


480 


TAGGGTGTTT 


TTGTGTTTAA 


ATACACAATG 


AAAATCAAAG 


AACAAACTAG 


AAAGCTAACT 


540 


TTAGGCTGCT 


CAAAACATAG 


TATATTGAAA 


CTAAAATAGT 


ACACCTCTAC 


TTCTAAAACA 


600 


TTGTTAGAAA 


TCGATTTGAC 


TGTCCTGATT 


ACGATTTATC 


CTGTTCTTAT 


TTCATTTTAC 


660 


TAT AC CATC A 


ATAAGTGTTG 


AAAAGATCGT 


TGACAGAACT 


G AC C AAGCC A 


GATCATCTTT 


720 


GTAACCACCT 


TGAGCTTGTA 


GCATATTTGT 


TAATTGAGCA 


AAGGCGCTAG 


AAGAGTTTGC 


780 


TGGGTC AGC A 


GTTGCGATTT 


TTCCTTTTAG 


TTCAGGTTTG 


AGAAGGTCGC 


TATATCCTTC 


840 


GATGTTCATG 


CCTTTAGTTA 


AATCAGGGTG 


GACGATTAAA 


ACACTACCAT 


CTAGTGTATA 


900 


AGGAGTAGAG 


AAGCCAGTTG 


TGTTTTGATA 


TTCTTTGATA 


ACATTATCAT 


TTTCTTTTGA 


960 


AATATAGTTT 


TCAAAGAGTT 


CTCCGTGGGT 


AGCATATTGT 


GTTATAAGAA 


CCACCAAAGA 


1020 


TAACATCAGC 


TACAGGAACT 


TCTTTTTCTG 


ACCTAGTTTT 


TTGAAAAGTT 


CTCCAGTACC 


1080 


AG 












1082 



(2) INFORMATION FOR SEQ ID NO: 213: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1303 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

CTGGACTTGA TAGGCATCTT TGTAATCCTC TAAAGCCTCT TTCATCAAGG C ACT AC C AAT 

178 
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TCCTTGACGC 


TGATAGATTG 


GTAAAACGAT 


TAAATCCTGA 


AC CAATACTG 


ATGAGAATCC 


120 


ATCTCCAACC 


AAACGAATCA 


AGCCCACCAC 


AGCATCACCA 


TCAAGTGCCA 


CATAAATTAC 


180 


TAATGAATGA 


GATAAGGCCT 


GCTCCAGCAT 


CTCTGGTTGA 


TGGGTATAAT 


TAGTCCAACC 


240 


GACAGCCTGA 


TAGACATGTA 


AAACATCCTC 


TAGCTTGACA 


ATTTCTTGCT 


TTTTAATAGT 


300 


AATCATCTCA 


ACACCTCTTA 


AAGTTCTCTC 


AAGCTCTTGT 


ACTGCTGTCC 


ATTTTTATCA 


360 


AAATTTTCAG 


GACGCAACCA 


TGTTTCC AAA 


CGATCTTTGA 


CTTTGGGCCA 


GTCCTTATCA 


420 


ATCATAGACA 


ACCAATCCAT 


ATCTCTCGTA 


CGCCCCTTAT 


AAACCACTGC 


CTGACGGAAG 


480 


GTTCCTTCAT 


AAATAAAGCC 


CAAACGCTCC 


GCAGCACGTC 


TGGATGGCAG 


GTTAAGAGCA 


540 


TCGCATTTCC 


ACTCATAACG 


ACGATAGTTA 


AGCTCTTCAA 


AGACATAGCG 


AGCCAGTAGA 


600 


TACTGGGCTT 


CTGTCCCTAT 


CCGTGTCCCC 


CTGAGTTTTG 


GAGAAAAAGT 


GACAGCTCCC 


660 


ACTTCTATTA 


CTCCGTTACT 


GGTCAATACG 


CATGAGAGAA 


AAATTCCCAA 


AGCCTTACCA 


720 


GTTGCCTTGT 


CTATGTAGTA 


AAAACGGTCC 


TTACGAGCCA 


ACATCTGACT 


ACTATAGTAG 


780 


ATTGAAACTA 


GAATAATATA 


CCTCTACTTC 


TAAAATATTG 


TTAGAAATCG 


ATTTGACTTC 


840 


CCTGATCGAT 


TCGTCCTATT 


CTTATTTCAT 


TTTAATATAA 


TTGATAGTGG 


TCGCCCCAGC 


900 


CAGATACCTT 


ATCTGCTATC 


CATTTAGGAA 


CCCCTAACTT 


AAGCAATCCC 


CATAATCGTC 


960 


TCGATTTCTT 


CTTCCATTGC 


TTCCAGATAA 


TCACTCGTAG 


GCGAGTACGC 


AAGCGCTCAT 


1020 


CTATGTTAGT 


GACTATACTT 


TTCATATTTA 


TAATTCATTC 


CTTTCGTTTC 


ACTCAAGGCA 


1080 


CAACACAGAA 


TGAAAAAGTG 


TTGTGATCTT 


TATTTTGTTT 


TATAATAATA 


GTGAGAAAGC 


1140 


CTATC AC TAC 


TACAAATCAC 


GGGGAGGTGA 


ATAAGTGAGT 


GGTACAGCCA 


CTACCTCGCA 


1200 


TATTTTGTCA 


CATCATTTAA 


CGGTACATAA 


TAAGTTGTAC 


CATCTGAATA 


AGTTGCTACA 


1260 


ATATCATTTG 


CATGCTCTCC 


TTCACCTTTA 


GCAAAGGTTG 


GAG 




1303 



{2} INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

CTCTTCAATC AACTCACGCT TTAGGGCTTC CTGATGATTT TCACCTGCTT CAATTTCTCC 
ACATGGTAGG AACCAAGCAC CATTTGGTTC TTGAACAAGA ACAATTTGTT TTTGTTCAGG 
ATTAGGGATA ACTGCATATA CGCCATAGCG AGCAATATAG TCTGTATTCG CTTTTTTCTC 
CGAAAGTTGG GTTTGCCATT GCATTTTCCT CATTATCTAG TATCGTTATT ATTATAGTGA 
AATGAAC CAA AAATAGTACA CAATGTGGTA TAATCTTTTT ATGGCATATT CAATAGATTT 
TCGTAAAAAA GTTCTCTCTT ATTGTGAGCG AACAGGTAGT ATAACAGAAG CATCACACG^ 
TTTCCAAATC TCACGTAATA CCATTTATGG CTGGTTAAAG CTAAAAGAGA AAACAGGAGA 
GCTAAAC C AC CAAGTAAAAG GAACAAAACC AAGAAAAGTT GATAGAGATA GACTTAAAAA 
CTATCTTACT GACAATCCAG ACGCTTATTT GACTGAAATA GCTTCTGAAT TTGGCTGTCA 
TCCAACTACC ATCCACTATG CACTCAAAGC TATGGGCTAC ACTCGAAAAA AAGAACCACA 

179 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



WO 97/37CB6 

CCTACTATGA ACAAGACCCA GAAAAAGTAG 
AGCACCTAGC ACAGGTCTAA CAAATGGTGA 
GATGACGAGC GACTTTTTTG AAGTATGGTT 
ACCATCGGTT ATTATAGTGA AATGAAATAA 
TCGAATTCTA ACAATGTTTT AGAAGTAGAG 
TAATGGACAA TGCAAGATTC CATAGAATGG 
GGCATAAACT TTTACCTTTT CCTCCCTACT 
AGTACATAAC TGCTTCTAAA ACATTCTTAT 
TCAGTTCTTA TTTCATTTTG CTCTACAATC 
AGGTATTGGC AAGTTGCAAT ACCTTTTTAC 
TGACTATATC TCCTATGGTT CTAGTTCAGA 
GTATCATTCC AAGTATTGAG AGTGAATGTT 
CTGGCATTTG CTAGACCGCC ATCTTTACGA 
CGAAGACTGG CAGTAAGATT GGCGCCGTGT 
TCTTTTAATG ATTTGATAAA TTGGATGGTC 
CCGAACATTC GAGTGTCAAA TTGAGCAAGA 
TAAATAG 



PO7US97/0530<6 

CCTTATTTCT TAAGAATTTT ANTAGTTTAA 660 

ATTAATCGCT CCAATGACTT ACGAAGAGAT 720 

TCAGAAGTTT CTCTTACCAA CATTAACCAC 780 

GAACAGGACN AATCGATCAG GACAGTAAAA 84 0 

GTGTACTATT CTAGTTTCAA TCTACTATAT 900 

GTAAGCTAGA GTTCTTATGT GAAGAGTTTG 9 60 

CATCTTAGTA TAGAAAAGTG AATCTAAAAT 102 0 

AAATTGATTT AAATTCTCAA ATCATATTAT 1080 

CTGTTGAGAA GACACGTGTT CATATCAAAA 1140 

AAGGTTCTTT TGTCTTATTT TTGTTTCAAC 1200 

AGGCTAGGCT ATAATTATGA TTGATAAGAA 12 60 

TCAAAATCAT GGGTTTCTAT AATGGTCAGG 132 0 

AGAAGTGGTT CTTTATAGCC TAGGAGAGTA 13 80 

CCGACAATTA GAATACGCTC AGCTGGACTA 1440 

CGCTGAGTTG TACTATAGAG GGATTCGGCT 1500 

TTTGAACGAA AAGCCTGGAT TTGTTGCGGG 15 60 

1567 



(2) INFORMATION FOR SEQ ID NO-.215: 

(i) SEQUENCE CHARACTERISTICS: 
{A> LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 15: 



CTGTTTCCAT 


AGCATGACTT 


CTGT AC TAG A 


CTTTCTTTTC 


CGAATAAATA 


GATAGAACCA 


60 


CAGAATCTAG 


TAAACCTAGA 


ATTAAAATTA 


TGGTATAATA 


TTAGCAATAA 


AAGAAATCTG 


120 


GAGGATTAGA 


ATCATGGTAT 


CAACGAAAAC 


ACAAATTGCT 


GGTTTTGAGT 


TTGACAATTG 


180 


CTTGATGAAT 


GCAGCAGGTG 


TGGCTTGTAT 


GACGATAGAG 


GAGTTAGAAG 


AGGTCAAAAA 


240 


CTCAGCGGCA 


GGAACCTTTG 


TTACTAAGAC 


AGCGACCTTG 


GACTTCCGTC 


AGGGGAATCC 


300 


TGAGCCACGC 


TACCAAGATG 


TTCCACTTGG 


TTCCATCAAC 


TCTATGGGCT 


TGCCAAATAA 


360 


TGGCTTAGAC 


TATTATTTGG 


ATTATCTTTT 


AGATTTGCAG 


GAAAAAGAGT 


CGAACCGAAC 


420 


TTTCTTCTTA 


TCTCTGGTCG 


GCATGTCTCC 


AGAGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


480 


CAGAGA 












486 



(2) INFORMATION FOR SEQ ID NO: 21 6: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 814 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: 



CTCTATCAGG 


AAAGTTAAAT 


TAATTTATAG 


AAATATTTTA 


GTATTTAAGC 


CCTACTGTTA 


60 


TAGATTCAAT 


ACACTATATA 


TGTGTTTGTC 


TGATAAAAAT 


TTCTACTCTT 


TTTGATTTTA 


120 


AATAAGTATT 


AGTTTACATT 


ATGGTGTGAA 


TTGGGTTTGA 


TATCTCTTTT 


GAGGAAGTTG 


180 


CCTTAGATTT 


TTCTGATTGT 


GTTTTATTGT 


ACAGTGTATC 


TTGCTTGTTT 


TGAACAGAAT 


240 


TTTTTATGAC 


ATTTGTCATA 


TTTTCTAGTG 


ACAGAAGCTT 


CTGCCTCCTC 


TGATTTTAAA 


300 


AGACTATAAT 


TGTAGTATGA 


AATGGGGGAA 


GAAGAGATGA 


GAAATAAAAT 


GATTATCGCA 


360 


GTAAGTTTAG 


TAGTAGCAGG 


AGTTATGACC 


TATCTCATGT 


TTTCGGGATT 


GGATGAGAAT 


420 


TTCTACCATT 


TTCCTTGGGA 


GGTCTTTGCT 


GGCTTTGGAA 


TCATCTTCTT 


GGCTTGTCCA 


480 


GAGAAGGTTT 


GAAATTAGTA 


AGAGATGTGA 


AAAAGGAGTT 


TGAAAAATGA 


AAAAAGCAAC 


540 


TATCTATTTC 


TTTATCGGCC 


TGTCACTCTT 


GGTATGGTTG 


GTAGAAATGT 


TTACAGGTTG 


600 


GTTTGCTCAA 


ACCTTCCTTC 


ACCAGTTCAT 


CCGTGGTGCA 


TGGGGATTAG 


GATTTATGAT 


660 


TTTTATCGCC 


TTTCCGATGG 


GAAAGGAGTT 


GCTGGAAGGA 


GAATATCATG 


AACATGATTA 


720 


AGGTTCAAGG 


ACTACATAAA 


AATATTAAGG 


GCAAGGCTAT 


TTCAAAGGAT 


ATTTCCTTTG 


780 


AAGTAGCAGA 


AGGTGAATGC 


GTTGCCTTGA 


TTGG 






814 



(2) INFORMATION FOR SEQ ID NO: 217: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



CTTGGTACGA 


CCGTATGGGT 


GGTCACTGAA 


AGTGGGAAAC 


TTCCAAGATG 


GGCACTGTGT 


60 


ACGGATCCCC 


GTAAACTGTC 


GCANAAGAAC 


TGAAGATGAT 


GTTTTTAC AG 


TTGTTTTCTT 


120 


CCATGGCTTT 


CAAAAGGCTG 


ACAGTTCCAG 


CGATATTGTT 


GTCATAGTAG 


GCAAGAGGGA 


180 


TACGTGTTGA 


TTCGCCAACA 


GCCTTCAAAC 


CAGCAAAGTG 


AATGACACCA 


GTCAGTTCTT 


240 


CCTGCTTGAA 


AATATCTCTG 


AGGGTATCTG 


TGTCACNAAT 


ATCTGCCTCA 


TAGAAAGGAA 


300 


TCTCAACTCC 


TGTGATTCTT 


TCAACAACTT 


CTAAACTCTT 


ACNATTGCTA 


TTGACAAGAT 


360 


TATCCACCAC 


AACAACTTGA 


TGGCCTGCTT 


GGATCAATTC 


AATAACAGTG 


TGGGTTCCAA 


420 


TAAAACCGGC 


ACCACCAGTT 


ACCAAAATCT 


TTTCTTGCAT 


CTTTTTTCCT 


CGATTCTCAG 


480 


ATTATTTTTT 


CTTATTTTAA 


CCATTTTTGA 


CAGGGAATGT 


C ATTTGC C AC 


CTTAAACTAC 


540 
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CTGATAAAAT TTCAGTAAAA TGCTTATACT 
GTCGCCTTGC CATGGGTATG GTTACTGACT 
CGCTTTGAGT AACCCGCGGC TAGTTTCNTA 
TCGCTTTTTA CTCGTTTGAC ATAGTTTTCA 
TCCTTGTCTT GGATCAGTTG AGCTAAATGG 
GATGAACCTA GTCCACTGGC TGCATAGACA 
GGAGAGAAAT CACTGGTGTA GGCACGGATT 
GCCAAAATCA GATAGTGAGT CAAGGTGGCC 
ACCGTCAAAT CAAATCCCAT GTCATTTTCG 
GCAAAAGGAA TCAAATCCCA CTCCCCTTCT 
TGGACCAGCT GATAATCTCG TAGTTGTCCT 
AAGGGTTCTA ACATGTCCCC CAACCAAGCT 
TCTTCACCAA TCTGGTAGCC TGATGCTAAC 



PCTAJS97/05306 



CTTGGAAAAT 


CCAATTCAAA 


CCACGTCAAC 


600 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAGCAG 


660 


GTTTGTTCTT 


TGATTTTCAT 


TGAGTATTAT 


720 


ATTGGGTAGT 


TTTCAGGGTC 


CAAGGTCAAC 


780 


TAACCAATAA 


TAGGACCAGT 


TGTGAGGCCT 


840 


CCAGTTAAGT 


CAGGCACCTG 


CCCAAAGAAA 


900 


CCAACACGCT 


CAGATTTTGA 


AGTAGC TTC A 


960 


TCCTCCATTT 


GTTGGAGCAA 


GGTTTCATCT 


1020 


TGGGTAGCGC 


CTAAGGACAA 


TTTCCCACCT 


1080 


GGCATGACAA 


CAGGGTAAGC 


TTCCATGTCT 


1140 


TTTTGAGGAC 


GAACATCTAC 


TTCATAACCC 


1200 


CCNGTCGCCA 


AAATAACCTG 


CTCAAACTCC 


1260 


GGTGTCAGAG 


TCACTTTTTC 


TTTGACCAG 


1319 



(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 



CTCAAAATAC 


TGTTTTGAGG 


TTGCAGATGG 


AAGCTGACGC 


GGTTTAAAGA 


GATTTTCGAA 


60 


GAGTATAAAC 


TGCTTATAAA 


ATAAAAAGGA 


GCCCTGATGG 


AACACATTAT 


TTATCAGTTT 


120 


GAAGAGGACT 


TGGCAATCCT 


TACCTTGAAC 


CGTCCTGAGG 


TCGCAAATGG 


TTTTCATATT 


180 


CCCATGTGTG 


AGGAGATTTT 


AGAAGCTCTG 


ACTTTGGCAG 


AAGAAAATCC 


AGCTGTGCAT 


240 


TTTATCTTAA 


TCAATGCCAA 


TGGAAAGGTC 


TTCTCAGTTG 


GGGGAGATTT 


GGTAGAGATG 


300 


AAGCGGGCAG 


TGGATGAGGA 


TGATATTCCA 


TCATTGACAA 


AAATCGCAGA 


GTTGGTCAAT 


360 


ACGATTTCTT 


ATAAAATCAA 


GCAAATAGCC 


AAACCTGTTT 


TTAATGGAAG 


TTGACGGTGC 


420 


TGTTGCAGGT 


GCCGCAGCGA 


ATATGGCTGT 


TGCGGCAGAT 


TTCTGTTTGG 


CGACGGATAA 


480 


GGCTAAGTTT 


ATCCCAAGCC 


CTTTGTTGGT 


GTGGTTGGCT 


CCAGATTCCA 


GGGG 


534 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) 



MOLECULE TYPE: Genomic 



DNA 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:219: 



CTGCAGCCGG 


AAGTTATCTG 


GTTACAAAAA 


ATCGCCAAAC 


CATCACAGAT 


GAAAGTCTTG 


60 


AATACCACTG 


ACCGCGTTCA 


AGCTATCAAG 


GACGATGTGG 


ATATTATC C A 


AAACAGCCTG 


120 


CAAATCATTA 


ACCAGCAAAA 


AGAACTTATC 


AAGGAATACC 


AAGAAGACTT 


GACTTACAAG 


180 


TTTAAGGTCT 


TGGAAAAGGA 


TATCCAAACT 


AGGACTAGCT 


GTGATAAAAG 


AAATGCAGGG 


240 


AACTGAAGAT 


AAGTAAAAAG 


AGCCCGACGG 


CTCTTTTTTA 


AAACGTAAAC 


AAGTTATAGT 


300 


GAATTGAATT 


TAGAATAGCA 


CATCATAGTT 


TCTAAAGCAT 


TGTTAGAGAT 


TACTTTAAAA 


360 


TCCTCTTATC 


AATTTGTTCA 


TATTCTATTT 


CAATC TACTA 


TATATATCCC 


ATCAACTATG 


420 


CATCATAATT 


TAGGTAACTC 


ATACTCAATA 


AAAATCAAAA 


AGCAAACTAG 


AAAGCTAGGC 


480 


ACAGACTGCT 


CAAAACACCG 


TTTTAAGGTT 


GTGGATAGAA 


CTGACGAAGT 


CAGTAACCAT 


540 


ACCCATGACA 


AGCCGACGCT 


GACGTGGTTT 


GAAGAAATTT 


TTGAAGAGTA 


TCATTCACCA 


600 


TTTCACAACT 


AAATCGAAGA 


CTTTCTGCCG 


TAATGAAAAC 


ACCTGAAACA 


GCTTGGTTTC 


660 


AGCTGTCCGG 


AAACTTTAAG 


ACTTAGGTTC 


AAAGTTTAGG 


TATGGAACTT 


CGAAGGAGGT 


720 


CGCTACCGTC 


CGTCATTACT 


TAGNGAAGNC 


TTAAAAAATC 


TATAAAGTAA 


AAAGAGCCCG 


780 


ACAG 












784 



(2) INFORMATION FOR. SEQ ID NO: 220: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

CTACTCGTTT TAGTATAGTA AACTGAATCT AGACTAGCAC AATGCNGCTT CTANAGCGTT 60 

TCGAAACTAG AATAGGACAA CATAACTGCT AAAAGATTTC TATAAATTCA TTTGATTTTC 120 

CTAATCAATT TGTTCGTATC CTANTTCACT CCACTATAAA ATATTCTTAT CAATTGATTT 180 

GNATGCCAAA ATTCCATCGT TCAGGTCTTA TTTCATTNGG CTNTAAAAAA CTCTCCTGGG 240 

NAATCCCAGG AGAGNATGAT TACNTATTTG AACTTGAATC CTTCGTANAT AAGCTCTGTT 3 00 

TTTGGATTTT GTTTCTTAAT CTGTTTGGCA AGTGCCTTCA TCATAGAAAT AGGACCACAC 360 

ATATAGACGG TTGCATGTTC GGGCACTTTT TTTGTTCAAA ATTAAGATAG CCGTCTTTCG 420 

TACTGTCGAT TAGATGGAGT TCAAAATTAG GATTTTTCTG AGCATAGTTA CGGAGTAAAT 4 80 

CTAGGTAGAC TGCATTTTCA TCTCCACGGA AGCTATAGTA GAAGTGAACC TGTTTATCTA 540 

AAATAGGATG TTCACGGATG TAAGAGATGA AGGGGGTGAT CCCAATACCT CCAGCAATCC 600 

AAACCTGATT TTCTCGTCCT TCTTCTATGA TCATGTGTCC GTAAGCTCTG TCTAGGGTTA 660 

CTTTGCTGCC GGCTTGAAGA TTATCATAGA TATTTCTTGG TATGGTCGCC TGAATTTTTA 720 

ACAGTAAAGT TAAAGAGTTT GAACCCATGA ACTCCCTGAG AATAGAAAAG GGATGCCGGA 780 

183 




ACCACTTTCA AAGCCTTCTT GG AAAATGTT TAGAAAGGCA AATTGTCCTG ATTGATAGTT 
GAAAGGTCTG CTAAGATGGA TTTGAATTTC TCTAGTATCG TGATTTAAGC GTTTGAGATG 
GGTAATTTTC CCTAGATAGG GGAAGGAAAT CTTTTGATAT AGAAAAATGA TATAAAAACC 



840 



900 



960 



AG 



962 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



CTCTTTCCAC 


TATTATAACA 


CGAAAGAAAG 


AATTGTCAGG 


GAACTGTACA 


GATTTCTTTT 


60 


CTATCTATTT 


ATAACTCAAT 


GAAAATCAAA 


GAGCAAACTA 


GGAAACTAGC 


CGCAGGCTGT 


120 


AATTGAGTAC 


GGCAAGGCGA 


CGTTGACGCG 


ATTTGAATTT 


GATTTTCGCA 


GAGTATTATT 


180 


CGTAAAAAAT 


CTCAAAAAGC 


CTACCTTTCG 


GTAGACTTAG 


TTTGTTTCTA 


TTCTAATCGG 


240 


CACTCTTCCA 


AAATTTTGCT 


CTGCTATACT 


TGGCTTTCCT 


AGTTGGTAAA 


TCTGGTCAAC 


300 


CTTTTGAGTC 


ATAGCATCCC 


AAGGTTCTTT 


GCCAATTCGG 


CTGACTAGAT 


TGACCTGTCC 


360 


TTTCAGAGAC 


TTGAGATGTT 


GCCTGCCTTT 


TTCAGTAAAT 


CCAAGGACAT 


GAATGGC TTC 


420 


TGGCAAGTCA 


CTTTCTCTAG 


CCTGCATCAA 


AATATAGGTC 


AATAGGCGTC 


TGACACGCGC 


480 


CTTGGTGTAA 


CGTTTGGTAG 


TCACTAACTC 


GACCAATTCT 


TC C AC AG ACT 


GGGC TGTTTT 


540 


AATAGCTTCT 


TTAATGCGCA 


CAGCCATTTC 


TTGATTGACC 


TGATAGATGG 


TGGTTAGGTC 


600 


GGGATTTGAC 


AAGATTTGAT 


AGCGGAGCAA 


GGGAAAATAG 


TCTTCCCAGA 


TCACCTTACT 


660 


GGCTTGCTCA 


AATAGGGCAA 


CAGAAGGCAT 


AAAACGTTCT 


AAGAAATCTT 


GGTCCTTCTG 


720 


ATGTTGACGG 


AGGGCTGTCG 


CCGAGGCAAA 


GTCCACATCT 


TTATTCACAG 


AATGGTAACC 


780 


TGCCCCCTGA 


CGCTGAATCG 


GATGCAGCTT 


GATATTCCGT 


CCTGCAACCG 


CCTTGGCATA 


840 


GGCCAAAACA 


AAGAACATGA 


TTGGGGTGTA 


TTACCTGAAA 


AATCAAGACC 


AGCAAATTCC 


900 


TTCCACATAG 
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(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



PCT/US97/05306 



60 



(2) INFORMATION FOR SEQ ID NO: 22 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 



CGCCAAGTCA TCCAACAAGT CCAGAACGGT TCTGGGGCTT GGGATCCTCA GTCGCTGGAG 

ATAACTCCTT TGGGCTTGTT CATCATGTAG TAGACAAACT CTTCATACTC CAACACTTGC 120 

CCATCAAAAG CGAATCTCAT CTATTTTTTC ATCAATCTGC AATTTAGCTG ATTTTTCTTT 180 

TTTACCATTT ACAGTCACGC GCCCAGCCTT GAGCAAGTTT TTGACCTCAG TCCGACTTCC 240 

CACCGCACAG GCAACTAAAA ATTTATCTAA TCTCATAGAA CTATTATATC ATATCAAAAG 3 00 

GAGGCTGGTA CAATGACCAA CCTCCTTTTC GTTTCATACT CTTCAAAAAT CTCTTCAAAC 3 60 

CGCGTCAACG TCGCCTTGCC GTATATATGT TACTGACTTC GTCAGTTCTA TCTGCAACCT 420 

CAAAGCAGTG CTTTGAGCAA CCTGCGGCTA GTTTCCTAGT TTGCTCTTTG ATTTTCATTG 480 

AGTATCAGAT TTAGGAAATT AACTTCCTCG TCTCCAAAAA ATAGCTAAGA CAATCATGGC 540 

ACCTAAAACA GCTGGGATAA TAGCTGTTCC TGATAAAACT GGCCCCCAAG TTCCAAAGAG 600 

CAAGTGACCT AGAAAGGCTC CGATCCAACC GAGAAACATT TTTCCAAAAC ATCCCATTCG 6 60 

CTCTCCACGA TTGGTCATAG CACCTGCTAA AAATCCCACT AGGAGACCAA CGAACATACT 720 

TCCTAACATA TTATCTCCTT AATTTGCCCA ATTCCCATTA CGGAAAAGAA GTACTCGCGT 780 

TCCATCCTCA CGAATACCAT CGATATCCAT TTGGTTAGAA CCAATCATAA AGTATACGTG 840 
AACATCTGAA CGGTTAAGCC CTGCAG 



866 



CTTCAGGATG 


TGGGAAAACG 


ACCCTTCTTC 


GTATGATTGC 


AGGTTTCAAC 


AGTATCAAAG 


60 


ATGGAGAATT 


TTACTTCGAT 


GATACAAAAA 


TCAATAATAT 


GGAACCCAGC 


AAACGCAATA 


120 


TCGGGCTGGT 


TTTCCAAAAC 


TACGCTATTT 


TCCCACATTT 


GACTGTCCGA 


GACAACGTTG 


180 


CTTTTGGTCT 


TATGCAAAAG 


AAGGTTCCAA 


AAGAAGAATT 


GATTCAACAG 


ACCAACAAGT 


240 


ATCTTGG AC T 


CATGCAAATT 


GCTCAATATG 


CGGATCGAAA 


GCCCGATAAA 


CTCAGTGGTG 


300 


GACAACAACA 


ACGTGTCACC 


TTGGCATGCG 


TCTTAGCGGT 


TAATCCAAGT 


GTTCTCCTCC 


360 


ATGGACGAAG 


CCACTTAGTA 


ATCTGGAGGC 


CAAACTTCGC 


TTGGATATGC 


GTTCAAGCCC 


420 


ATCCCGAGAA 


ATCCAACCAC 


GAAGTTGGGG 


AATTACAACT 


GTTTATGTAA 


CCCACGACCA 


480 


AGAAGGAGCC 


ATGGCTATTT 


CAGACCAAAT 


TGCCTGTTAT 


GAAAGATGGG 


GTGATCCAAC 


540 


AAATCGGCCG 


ACCAAAAGAA 


CTGTATCATA 


AACCAGCTAA 


TGAGTTAGTG 


GCAACCTTTA 


600 


TCGGACGCAC 


AAATATTATC 


CCTGCCAATC 


TTGAAAAACG 


GAGCGACGGC 


GNTTATATCG 


660 


TCTNTTCAGA 


TGGANANGCC 


CTTCGAATGA 


TAG 






693 



(2) INFORMATION FOR SEQ ID NO: 224: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 24: 

Met Ser Met Phe Thr Met Val Lys Met Asn Pro Leu Arg Gly Leu lie 

15 10 15 

Cys Asn Leu Lys Gly Asn Lys lie Thr Ala Leu lie Gly Pro Ser Gly 

20 25 30 

Ser Gly Lys Ser Thr Tyr Leu Arg Ser Leu Asn Arg Met Asn Asp Thr 

35 40 45 

lie Asp lie Ala Lys Val Thr Gly Gin lie Leu Tyr Arg Gly lie Asp 

50 55 60 

Val Asn Arg Pro Glu He Asn Val Tyr Glu Met Arg Lys His He Gly 
65 70 75 80 

Met Val Phe Gin Arg Pro Asn Pro Phe Ala Lys Ser He Tyr Arg Asn 

85 90 95 

He Thr Phe Ala His Glu Arg Ala Gly Val Lys Asp Lys Gin Val Leu 

100 105 110 

Asp Glu He Val Glu Thr Ser Leu Ser Gin Ala Ala Leu Trp Asp Gin 

115 120 125 

Val Lys Asp Asp Leu His Lys Ser Ala Leu Thr Leu Ser Gly Gly Gin 

130 135 140 

Gin Gin Arg Leu Cys He Ala Arg Ala He Ser Val Lys Pro Asp He 
145 150 155 160 

Leu Leu Met Asp Glu Pro Ser Leu Ser Leu Gly Ser Asp Cys Asp His 

165 170 175 

Ala Thr Arg Arg Asp His Val 
180 



(2) INFORMATION FOR SEQ ID NO: 225: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



PCT/USS>7/®S306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

Met lie Lys lie Ser Asn Leu Ser Lys Ser Phe Ser Gly Gin Thr Val 

1 5 io is 

Leu Asp His Leu Asn Leu Asp He Gin Lys Gly Glu Val Val Ala Leu 

20 25 30 

lie Gly Ser Ser Gly Ala Gly Lys Ser Thr Phe Leu Arg Ser Leu Asn 

35 40 45 

Tyr Leu Glu Thr Pro Asp Ser Gly Ser He Gin He Asp Gly Phe Ser 

50 55 60 

Val Asp Phe Ser Lys He Thr Gin Glu Glu He Leu Ala Leu Arg Arg 
65 ™ 75 ~ go 

Lys Leu Ser Met Val Phe Gin Gin Phe Asn Leu Phe Glu Arg Arg Thr 

85 go 95 

Ala Leu Asp Asn Val Lys Glu Gly Leu Val Val Val Lys Lys Leu Ser 

100 io5 no 

Asp Gin Glu Ala Thr Lys He Ala Lys Glu Glu Leu Ala Lys Val Gly 

115 120 i 2 5 

Leu Ser Asp Arg Glu Asn His Tyr Pro Arg His Leu Ser Gly Gly Gin 

130 135 140 

Lys Gin Arg Val Ala Leu Ala Arg Ala Leu Ala Met Lys Pro Asp Val 

145 150 i cc 

X3U 155 

Leu Leu Leu Asp Glu Pro Thr Ser Ala Leu Asp Pro Glu Leu Val Gly 

16 5 170 175 

Glu Val Glu Lys Ser He Ala Asp Ala Ala Lys Ser Gly Gin Thr Met 

180 185 i 90 

lie Leu Val Ser His Asp Met Pro Phe Val Ala Gin Val Ala Asp Lys 

195 200 205 • 

lie Leu Phe Leu Asp Lys Gly Lys He He Glu Ser Gly Thr Pro Asp 

210 215 220 

Glu He He His Thr Pro Lys Glu Glu Arg Thr Lys Glu Phe Phe Thr 

225 230 235 240 

Ser Tyr Lys Arg Thr Tyr He 

245 

(2) INFORMATION FOR SEQ ID NO:226: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
.(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 6: 



Met Phe Ser Leu Arg Ser Val Phe Asp Gly lie Pro Arg lie Val Gin 

1*5 10 15 

Gin Leu Pro Thr Thr lie Met Leu Thr lie Gly Gly Ala Leu Phe Gly 

20 25 30 

Leu Val Leu Ala Leu Leu Phe Ala lie Val Lys lie Asn Arg Val Lys 

35 40 45 

He Leu Tyr Pro Leu Gin Ala Phe Phe Val Ser Phe Leu Lys Gly Thr 

50 55 60 

Pro He Leu Val Gin Leu Met Leu Thr Tyr Tyr Gly He Pro Leu Ala 
65 70 75 80 

Leu Lys Ala Leu Asn Gin Gin Trp Gly Thr Gly Leu Asn He Asn Ala 

85 90 95 

He Pro Ala Ala Ala Phe Ala He Val Ala Phe Ala Phe Asn Glu Ala 

100 105 110 

Ala Tyr Ala Ser Glu Thr lie Arg Ala Ala He Leu Ser Val Asn Pro 

115 120 125 

Gly Glu He Glu Ala Ala Arg Ser Leu Gly Met Thr Arg Ala Gin Val 

130 135 140 

Tyr Arg Arg Val He He Pro Asn Ala Ala Val Val Ala Thr Pro Thr 
145 150 155 160 

Leu He Asn Ser Leu He Gly Leu Thr Lys Gly Thr Ser Leu Ala Phe 

165 170 175 

Ser Ala Gly Val Val Glu Val Phe Ala Gin Ala Gin He Leu Gly Gly 

180 185 190 

Ala Asp Tyr Arg Tyr Phe Glu Arg Phe He Ser Val Ala Leu Val Tyr 

195 200 205 

Trp Val Val Asn He Gly He Glu Ser Leu Gly Arg Phe He Glu Arg 

210 215 220 

Lys Met Ala He Ser Ala Pro Asp Thr Val Gin Thr Asp Val Lys Gly 
225 230 235 240 

Asp Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids . 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single. 
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(D) TOPOLOGY: linear 



PCT/US97/0S306 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 



Met Tyr Arg He Asp Asp Asp Glu Gin Phe Val Leu Asp Phe Leu Lys 

15 10 15 

Gin Glu Lys Val Leu Leu Val His Gly Arg Gly Phe Asn Trp Gin Glu 

20 25 30 

Pro Asp His Phe Arg He Val Tyr Leu Pro Arg Val Asp Glu Leu Ala 

35 40 45 

Gin He Gin Glu Lys Met Thr Arg Phe Leu Lys Gin Tyr Arg Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 
Met Asn Asp Glu Ala Ser Lys Gin Leu Thr Asp Ala Arg Phe Lys Arg 

1 5 io 1S 

Leu Val Gly Val Gin Arg Thr Thr Phe Glu Glu He Leu Ala Val Leu 

20 25 30 

Lys Thr Ala Tyr Gin Leu Lys His Ala Lys Gly Gly Arg Lys Pro Lys 

35 40 45 

Leu Ser Leu Glu Asp Leu Leu Met Ala Thr Leu Gin Tyr Val Arg Glu 

50 55 ' 60 

Tyr Arg Thr Tyr Glu Gin He Ala Ala Asp Phe Gly He His Glu Ser 



65 



70 



75 



80 



Asn Leu lie Arg Arg Ser Gin Trp Val Glu Val Thr Leu Val Gin Ser 

85 90 95 

Gly Val Thr He Ser Arg Thr Pro Leu Ser Ser Glu Asp Thr Val Met 



100 105 



110 



He Asp Ala Thr Glu Val Gin He Asn Arg Pro Lys Lys Arg He Ser 
115 120 125 
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Glu Ser Phe Trp 
130 



JPCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 22 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(.C) STRANDEDNESS : single 
(D). TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 



Met Arg Arg Lys Tyr Lys Ser lie Ala Leu Lys Lys Glu Leu Ala Asn 

15 10 15 

His Ser Gly Lys Lys Lys Phe His Ala Met Lys Ala Gin Ala lie Val 

20 25 30 

Thr Ser Gin Gly Arg He Val Ser Leu Asp He Ala Val Asn Tyr Ser 

35 40 . 45 

His Asp Met Lys Leu Phe Lys Met Ser Cys Arg Asn He Gly Gin Ala 

50 55 60 

Gly Lys He Leu Ala Asp Ser Gly Tyr Gin Gly Pro Met Lys He Tyr 
65 70 75 .80 

Pro Gin Ala Gin Thr Pro Arg Lys Ser Ser Lys Leu Lys Pro Leu He 

85 90 95 

Ala Glu Asp Lys Ala Tyr Asn His Ala Leu Ser Lys Glu Arg Ser Lys 

100 105 110 

Val Glu Asn He Phe Ala Lys Val Lys Thr Phe Lys Met Phe Ser Thr 

115 120 125 

Thr Tyr Arg Asn His Arg Lys Arg Phe Gly Leu Arg Met Asn Leu lie 

130 135 140 

Ala Gly He He Asn Tyr Glu Leu Gly Phe 
145 150 



(2) INFORMATION FOR SEQ ID NO: 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

Met Val Ser Ser Ser Gly Ser Glu Phe Gin Ser Gly Tip Gin Glu His 

15 io 15 

Gin Leu lie Ala Glu Lys Val Ser Lys Thr Leu Asp Lys Thr Phe Asp 

20 25 30 

Lys Asp Val Arg Lys He Pro Thr Ser Pro Val Leu Ser Lys He Cys 
35 . 40 45 

Arg 



(2) INFORMATION FOR SEQ ID NO: 231: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

Met He Lys lie Leu Ala Ala Cys Gly Ala Gly Val Asn Ser Ser His 

1 5 10 15 

Gin He Lys Ser Ala Leu Glu Glu Glu Leu Ser Asn Arg Gly Tyr Asp 

20 25 30 

Val His Cys Asp Ala Val Met Val Lys Asp Val Asn Glu Asp Leu Met 

35 40 45 

Lys Gly Tyr Asp He Phe Thr Pro He Ala Ala Thr Asp Leu Gly Phe 

50 55 60 

Glu Pro Gly He Pro Val He Glu Ala Gly Pro He Leu Phe Arg He 
65 7 0 75 ~ 80 

Pro Ala Met Ser Ala Pro Val Phe Asp Asn He Arg Leu Pro Ala Lys 
85 90 . 95 

Gin Asn Met Val 
100 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
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(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/053O6 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Met Asn Thr Lys Met Met Ser Gin Phe Ser Val Met Asp Asn Glu Met 

1 5 10 15 

Leu Ala Cys Val Glu Gly Gly Asp lie Asp Trp Gly Arg Glu lie Ser 

20 25 30 

Cys Ala Ala Gly Val Ala Tyr Gly Ala lie Asp Gly Cys Ala Thr Thr 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO: 2 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single. 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 3: 



Met Arg Glu Lys Glu lie Phe Asp Ser lie Val Thr lie lie Gin Glu 

1 5 10 15 

Arg Gin Gly Glu Asp Phe Val Val Thr Glu Ser Leu Ser Leu Lys Asp 

20 25 30 

Asp Leu Asp Ala Asp Ser Val Asp Leu Met Glu Phe lie Leu Thr Leu 

35 40 45 

Glu Asp Glu Phe Ser lie Glu lie Ser Asp Glu Glu lie Asp Gin Leu 

50 55 60 

Gin Ser Val Gly Asp Val Val Lys lie He Gin Gly Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
Met Ala His Gly Asp Leu Leu Tyr His Asp Gly Leu Phe Phe Ser Ala 

Lys Lys Glu Asp Gly Thr Tyr Asp Phe His Glu Asn Phe Glu Tyr Val 

20 25 30 

Thr Pro Trp Leu Lys Gin Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Met Ser Leu lie Thr His Arg Arg Phe He Ser Cys Asn Glu Asn He 

1 5 10 15 

Lys His Tyr Lys Arg Leu He Asp Lys Ala Glu Lys Cys Val Asn Asn 

20 25 30 

Leu Met Ala Glu Leu Asn Ser Val He Thr Thr Val Thr Gly He Glu 

35 40 45 

Asn Arg Leu Gly Ala Val He Leu Ala Glu lie Arg Asn Ile His Ala 

50 55 60 

Phe Asp Asn Pro Ala Gin Leu Gin Ala Phe Ala Gly Leu Asp Ser Ser 
" 70 7 5 80 

He Tyr Gin Ser Gly Gin lie Asp Leu Val Gly Arg Met Val Lys Arg 

85 9 ° 95 

Gly Ser Leu His. Leu Arg 

100 

(2) INFORMATION FOR SEQ ID NO: 23 6: 
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( i ) SEQUENCE CHARACTERISTICS : 
<A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 6: 

Met lie Arg Ala Val Phe Phe Ser lie Asp Asn Asp Met Glu Glu Ala 

15 10 15 

Ala Arg Ser Met Gly Ala Ser Ser Phe Tyr Thr Met Val Arg Val lie 

20 25 30 

lie Pro Tyr He Leu Pro Val Val Leu Ser Val Val Val Leu Asn Phe 

35 40 45 

Asn Ser Leu Leu Ser Asp Tyr Asp Leu Ser Val Phe Leu Tyr His Pro 

50 55 60 

Leu Phe Gin Pro Leu Gly He Val He Lys Gin Ser Thr Asp Glu Thr 
65 70 75 80 

Ala Thr Leu Asn Ala Gin Ala Met Met Phe Val Tyr Ser Val He Leu 

85 90 95 

Met He Met Ser Ser Ala Ala Leu Tyr Leu Ser Ser Leu Phe Gin Gly 

100 105 110 

Lys Arg Gly Lys Arg 
115 



(2) INFORMATION FOR SEQ ID NO: 23 7: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 7: 

Met Glu Lys Cys Tyr Thr Asp Val Thr Glu Phe Ala He Pro Ala Ser 

15 10 15 

Thr Gin Lys Leu Tyr Leu Ser Pro Val Leu Asp Gly Phe Asn Ser Glu 
20 25 30 
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He He Ala Tyr Asn Leu Ser Thr Ser Pro Asn Leu Glu Gin Val Gin 

35 40 45 

Thr Met Leu Glu Gin Ala Phe Thr Glu Lys His Tyr Glu Asn Thr He 

50 55 60 

Leu His Ser Asp Gin Gly Trp Gin Tyr Gin His Asp Ser Tyr His Arg 
65 70 75 80 

Phe Leu Arg Val Arg Glu Phe Lys His Leu Cys His Ala Arg Glu Thr 

85 90 „ 

Ala Lys Thr Thr Val 

100 

(2) INFORMATION FOR SEQ ID NO: 2 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

Met Glu lie Leu Thr Thr Gin Gin Val Arg Glu Gly Tyr Gin Ala He 

Leu Ala Ser Pro Leu Gly Leu Gin Asp Ala Phe Glu Val Ala Gin Glu 

20 25 30 

Lys Ser Gly Ser Tyr Thr Val Pro Pro Glu He Asp Gly Trp Lys Gly 

35 40 45 

Asn Thr Glu Pro Leu Arg He Asp Tyr Val Phe Thr Thr Lys Glu Leu 



50 



55 



60 



Ala Val Glu Asn Leu His Val Val Phe Asp Gly Asn Lys Ser Pro Gin 



65 70 75 

Val Ser Asp His Tyr Gly Leu Asn Ala Met Leu Asn Trp Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 2 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



80 
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(ii) MOLECULE TYPE: protein 



PCT/US97/05306 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 



Met Leu Lys His Leu Asn Leu Lys Gly His Leu Leu Thr Ala lie Ser 

15 10 15 

Tyr Met He Pro He Val Cys Gly Ala Gly Phe Leu Val Ala He Gly 

20 25 30 

Leu Ala Met Gly Gly Gly Val Pro Asp Ala Leu Val Ala Gly Lys Phe 

35. 40 45 

Thr He Trp Asp Ala Leu Ala Thr Met Gly Gly Lys Ala Leu Gly Leu 

50 55 60 

Leu Pro Val Val He Ala Thr Gly Leu Ser Tyr Ser He Ala Gly Lys 
65 70 75 80 

Pro Gly He Ala Pro Gly Phe Val Val Gly Leu He Ala Asn Ser Val 

85 90 . 95 

Gly Ser Gly Phe He Gly Gly He Leu Gly Gly Tyr He Val Gly Phe 

100 105 110 

Leu Val Gin Ala He He Lys Lys Val Lys Val Pro Asn Trp He Lys 

115 120 125 

Gly Leu Met Pro Thr Leu He He Pro Phe Val Pro Leu Trp 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO.: 240 : 



Met He Ala Leu 
1 

Ala Ala Leu Ala 
20 

Asp Phe Thr Lys 
35 

Leu Ala Val Ala 
50 



Gly Trp Ser Asn 
5 

Ser Val Ala Ala 

Thr Gly He Gly 
40 

Gly Leu Phe Leu 
55 



He Gly Ala Ala 

10 ■ 

Ala He He Met 
25 

Val Ala Gin Ala 

Thr Met He Val 
60 



He Ala Pro Asp 
15 

Val Leu Gly Gly 
30 

Val Ala He Pro 
45 

Arg Thr He Ser 
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Val Gly Leu Val His Thr Ala Asp Ala Ala Ala Lys Lys Gly Asp Phe 
65 70 75 so 

Gly Ala Val Glu Arg Ala His Phe lie Ala Leu Leu Phe Gin Gly Leu 

85 90 g 5 

Arg He Ala Leu Pro Ala Ala Leu Leu Leu Met Val Pro Thr Glu Thr 

100 105 110 

Val Gin Ser He Leu Ser Ala Met Pro Asp Trp Leu Lys Asp Gly Met 

115 120 125 

Ala He Gly Gly Gly Met Val Val Ala Val Gly Tyr Ala Met Val lie 

130 "5 140 

Asn Met Met Ala Thr Arg Glu Val Trp Pro Phe Phe Ala Leu Gly Phe 
145 150 155 160 

Val Leu Ala Ala Val Ser Asp lie Thr Leu lie Gly Phe Gly Ala He 

i65 170 175 

Gly Val Ala lie Ala Leu He Tyr Leu His Leu Ser Lys Thr Gly Gly 

180 185 190 

Asn Gly Gly Gly Gly Ala Ala Thr Ser Asn Asp Pro He Gly Asp He 

195 200 205 

Leu Glu Asp Tyr 

210 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

Met Leu Asp Val Glu Ala He Arg Lys Asp Phe Pro He Leu Asp Gin 

5 1° 15 

He Val Asn Asp Glu Pro Leu Val Tyr Leu Asp Asn Ala Ala Thr Thr 

20 25 30 

Gin Lys Pro Leu Val Val Leu Lys Ala He Asn Ser Tyr Tyr Glu Gin 

35 40 45 

Asp Asn Ala Asn Val His Arg Gly Val His Thr Leu Ala Glu Arg Ala 
50 55 



Thr Ala Ser Leu 
65 



197 



WO 97/37026 

(2) INFORMATION FOR SEQ ID NO: 242: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

Met Thr Lys Leu Leu Asn Lys Lys Trp Lys Val Lys lie Met Lys Gin 

15 10 15 

lie Leu Leu Val Cys Asn Ala Gly Met Ser Thr Ser Met Leu Val Lys 

20 25 30 

Lys Met Gin Gin Ser Ala Thr Glu Arg Gly lie Glu lie Ser He Gin 

35 40 45 

Ala Lys Ser Met Thr Glu Ala Lys Lys Asn He His Glu Ala Asp Val 

50 55 60 

He Leu He Gly Pro Gin He Arg Tyr Glu. Leu Leu Ala Val Lys Glu 
65 70 75 80 

He Ala Gly Asn He Pro Val Asp Thr He Asp Met Arg Asp Tyr Gly 

85 90 95 

Met Met Asn Gly Ala Lys Val Leu Glu Gin Ala Leu Glu Trp He Gly 
100 105 110 

Glu He Arg 
115 



(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 



Met Val Leu Met Gin Lys Met His Gly Lys Glu Leu He Thr Leu Asn 
15 10 15 
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Gin Thr Val Lys Trp Tyr Lys Val 
20 

Lys Pro Trp Tyr Tyr Leu Lys Ser 

35 40 
Trp Gin Gly Asn Tyr Tyr Leu Lys 

50 55 
Glu Trp Val Tyr Asp Ala Thr Tyr 
65 70 
Asp Gly Ser Tyr Ala Tyr Ser Thr 
85 

He Gly Trp 



PCT/US97/0S3O6 

Ser Gly Phe Met lie Leu Leu Thr 
25 30 
Asp Gly Ser Tyr Ala Arg Asn Ala 
45 

Ser Asp Gly Lys Met Ala Val Asn 
60 

Gin Ala Trp Tyr Tyr Leu Thr Ser 
7 5 80. 
Trp Gin Gly Asn Tyr Tyr Pro Lys 
90 95 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

Met Leu Thr Val His Gly Lys Glu lie Thr lie Leu Lys Ser Asp Gly 

5 10 15 

Lys Met Ala Val Asn Glu Trp Val Asp Gly Gly Arg Tyr Tyr Val Gly 

20 25 30 

Ala Asp Gly Val Trp Lys Glu Gly Gin Ala Ser Thr Ala Ser Pro Ser 

35 4 0 45 

Asn Asp Ser Asn Ser Glu Tyr Ser Cys Cys Phe Arg Lys Gly Lys Lys 



60 

Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: protein 
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(XX) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

Met Asn Asn Asn Phe Asn Asn Phe Asn Asn Met Asp Asp Leu Phe Asn 

15 10 15 

Gin Leu Met Gly Gly Met Arg Gly Tyr Ser Ser Glu Asn Arg Arg Tyr 

20 25 30 

Leu lie Asn Gly Arg Glu Val Thr Pro Glu Glu Phe Ala His Tyr Arg 

35 . 40 45 

Thr Thr Gly Gin Leu Pro Gly Asn Ala Glu Thr Asp Val Gin Met Pro 

50 55 60 

Gin Gin Ala Ser Gly Met Lys Gin Asp Gly Val Leu Ala Lys Leu Gly 
65 70 75 80 

Arg Asn Leu Thr Ala Glu Ala Arg Glu Gly Lys Leu Asp Pro Val lie 

85 90 95 

Gly Arg Asn Lys Glu lie Gin Glu Thr Ser Glu lie Leu Ser Arg Arg 

100 105 110 

Thr Lys Asn Asn Pro Val Leu Val Gly Asp Ala Gly Val Gly Lys Thr 

115 120 125 

Ala Val Val Glu Gly Leu Ala Gin Ala lie Val Asn Gly Asp Val Pro 

130 135 140 

Ala Ala lie Lys Asn Lys Glu lie lie Ser lie Asp lie Ser Gly Leu 
145 150 155 160 

Glu Ala Gly Thr Gin Tyr Arg Gly Ser Phe Glu Glu Asn Val Gin Asn 

165 170 175 

Leu Val Asn Glu Val Lys Glu Ala Gly Asn lie He Leu Phe Phe Asp 

180 185 190 

Glu He His Gin He Leu Gly Ala Gly Ser Thr Cys Gly Asp Ser Gly 

195 200 205 

Ser Lys Gly Leu Ala Asp He Leu Ser Gin Ser He Ser Leu Val Glu 
210 215 220 

Asn 



225 



(2) INFORMATION FOR SEQ ID NO: 246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino, acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

Met Thr He Phe Pro Asp Asp Phe Leu Trp Gly Gly Ala Val Ala Ala 

1 5 10 15 

Asn Gin Val Glu Gly Ala Tyr Asn Glu Asp Gly Lys Gly Leu Ser Val 

20 25 * 3 0 

Gin Asp Val Leu Pro Lys Gly Gly Leu Gly Glu Ala Thr Glu Asn Pro 

35 40 45 

Thr Glu Asp Asn Leu Lys Leu He Gly He Asp Phe Tyr His Lys Tyr 

50 55 60 

Lys Glu Asp He Ser Leu Phe Ser Glu Met Gly Phe Asn Val Phe Arg 
65 70 75 80 

Thr Ser He Ala Trp Ser Arg He Phe Pro Lys Gly Asp Glu Glu Glu 

95 90 95 

Pro Asn Glu Ala Gly Leu Lys Tyr Tyr Asp Glu Leu Phe Asp Glu Leu 

100 105 no 

His Ala His Gly He Glu Pro Leu Val Thr Leu Ser His Tyr Glu Thr 

115 120 12 5 

Pro Leu Tyr Leu Ala Arg Lys Tyr His Gly Trp He Asp Arg Lys Asn 

130 135 140 

Asp Ser Phe Leu 
145 

(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

Met He Glu Tyr Lys Asn Val Ala Leu Arg Tyr Thr Glu Lys Asp Val 

5 10 15 

Leu Arg Asp Val Asn Leu Gin He Glu Asp Gly Glu Phe Met Val Leu 

20 25 30 

Val Gly Pro Ser Gly Ser Gly Lys Thr Thr Met Leu Lys Met He Asn 

35 40 45 

Arg Leu Leu Glu Pro Thr Asp Gly Asn He Tyr Met Asp Gly Lys Arg 
50 55 60 
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lie Lys Asp Tyr Asp Glu Arg Glu Leu Arg Leu Ser Thr Gly Tyr Val 
65 70 75 80 

Leu Gin Ala lie Ala Leu Phe Pro Asn Leu Thr Val Ala Glu Asn lie 

85 90 95 

Ala Leu lie Pro Glu Met Lys Gly Trp Ser Lys Glu Glu lie Thr Lys 

100 105 110 

Lys Thr Glu Glu Leu Leu Ala Lys Val Gly Leu Pro Val Ala Glu Tyr 

115 120 125 

Gly His Arg Leu Pro Ser Glu Leu Ser Gly Gly Glu Gin Gin Arg Val 

130 135 140 

Gly lie Val Arg Ala Met lie Gly Gin Pro Lys lie Phe Leu Met Asp 
145 150 155 160 

Glu Pro Phe Ser Ala Leu Asp Ala lie Ser Arg Lys Gin Leu Gin Val 

165 170 175 

Leu Thr Lys Glu Leu His Lys Glu Phe Gly Met Thr Thr lie Phe Val 

180 185 190 

Thr His Asp Thr Asp Glu Ala Leu Lys Leu Ala Asp Arg lie Ala Val 

195 200 205 

Leu Gin Asp Gly Glu lie Arg Gin Val Ala Asn Pro Glu Thr lie Leu 

210 215 220 

Lys Val Pro Ala Thr Asp Phe Val Ala Asp Leu Phe Gly Gly Ser Val 
225 230 235 240 

His Asp 



(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: 

Met. Pro Phe Val Pro Gly lie Ala Leu Thr Asn Ala Val Arg Asp lie 

15 10 15 

Met Thr Asn His lie Asn Ser Gly Met Ser Lys Met Phe Glu Ser Leu 

20 25 30 

Leu lie Thr Leu Ala Leu Gly Ala Gly Thr Ser Val Ala Leu Val Leu 
35 40 45 
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50 
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(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

Met Tyr Thr He Arg Met Leu Lys Met Gly Ser Glu Ala Ala Ala Lys 

1 5 10 15 

Ser Ala Gin Glu His Gly Leu Lys Ser Val Glu Val Thr Val Lys Gly 

20 25 30 

Pro Gly Ser Gly Arg Glu Ser Ala He Ser Cys Ala Cys Cys Arg Trp 
35 40 45 

Ser 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Met lie Glu Phe Glu Lys Pro Asn lie Thr Lys He Asp Glu Asn Lys 

Asp Tyr Gly Lys Leu Val He Glu Pro Leu Glu Arg Gly Tyr Gly Thr 

20 25 30 

Ala Leu Gly Asn Ser Leu Arg Arg Val Leu Leu Ala Ser Leu Pro Gly 



35 40 



Ala Ala Val Thr Ser He Asn He Asp Gly Val Leu nls Glu Phe Asp 
50 55 60 
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Thr Val Pro Gly Val Arg Glu Asp Val Met Gin lie lie Leu Asn lie 
65 70 75 80 

Lys Gly lie Ala Val Lys Ser Tyr Val Glu Asp Glu Lys lie lie Glu 

85 90 95 

Leu Asp Val Glu Gly Pro Ala Glu Val Thr Ala Gly Asp lie Leu Thr 

100 105 110 

Asp Ser Asp lie Glu lie Val Asn Pro Asp His Tyr Leu Phe Thr He 

115 120 125 

Gly Glu Gly Ser Ser Leu Lys Ala Thr Met Thr Val Asn Ser Gly Arg 

130 135 140 

Gly Tyr Val Pro Ala Asp Glu Asn Lys Lys Asp Asn Ala Pro Val Gly 
145 . 150 155 160 

Thr Leu Ala Val Asp Ser He Tyr Thr Pro Val Thr Lys Val Asn Tyr 

165 170 175 

Gin Val Glu Pro Ala Arg Val Gly Ser Asn Asp Gly Phe Asp Lys Leu 

180 185 190 

Thr Leu Glu lie Leu Thr Asn Gly Thr He He Pro Glu Asp Ala Leu 

195 200 205 

Gly Leu Ser Ala Arg lie Leu Thr Glu His Leu Asp Leu Phe Thr Asn 

210 215 220 

Leu Thr Glu He Ala .Lys Ser Thr Glu Val Met Lys Glu Ala Asp Thr 
225 230 235 240 

Glu Ser Asp Asp Arg He Leu Asp Arg Thr He Glu Glu Leu Asp Leu 

245 250 255 

Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn Thr Val 

260 265 270 

His Asp Leu Thr Glu Lys Ser Glu Ala Glu Met Met Lys Val Arg Asn 

275 280 285 

Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Leu Lys Leu He Asp Leu 

290 295 300 

Gly Leu Gly Leu Lys Asp Lys 
305 310 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
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Met Asp He Glu Lys Gin Arg Gly He Ser Val Thr Ser Ser Val Met 

1 5 10 15 

Gin Phe Asp Tyr Asp Gly Lys Arg Val Asn He Leu Asp Thr Pro Gly 

20 25 30 

His Glu Asp Phe Ser Glu Asp Thr Tyr Arg Thr Leu Met Ala Val Asp 

35 40 45 

Ala Ala Val Met Val Val Asp Ser Ala Lys Gly He Glu Ala Gin Thr 

50 55 60 

Lys Lys Leu Phe Glu Val Val Lys His Arg Gly He Pro Val Phe Thr 
" 70 75 eo 

Phe Met Asn Lys Leu Asp Arg Asp Gly Arg Glu Pro Leu Asp Leu Leu 

85 90 95 

Gin Glu Leu Glu Glu He Leu Gly He Ala Ser Tyr Pro Met Asn Trp 

ioo 105 110 

Pro lie Gly Met Gly Lys Ala Phe Glu Gly Leu Tyr Asp Leu Tyr Asn 

115 120 125 

Gin Arg Leu Glu Leu Tyr Lys Gly Asp Glu Arg Phe Ala Ser Pro Arg 

130 135 140 

Arg Trp Arg Gin Thr Phe Trp 
1*5 150 

(2) INFORMATION FOR SEQ ID NO : 2 52: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 

Met Ala Val He Ala Glu Arg Lys Ala Tyr Tyr His Asp Pro Val Glu 

1 5 10 i5 

Asp Ala He He Met. Lys Arg Glu He Asp Glu Gly 



20 25 
(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
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(C) 
(D) 



STRANDEDNESS: single 
TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

Met Asn Glu Thr Trp Ser Arg Thr Ala Leu Thr Leu Leu Lys His Gly 

15 10 15 

Ser Glu Val Asn Leu Glu Arg Ala Leu Ser Val Asn Gly Arg Leu Gly 

20 25 30 

Gly His Val Val Thr Gly His lie Asp Gly Thr Gly Lys lie Ser Ser 

35 40 45 

lie Lys Lys Asp Asp Asn Ala Val Trp Tyr Gin lie Asn Thr Gin Lys 

50 55 60 

Glu lie Leu Asp Leu lie Val Glu Lys Gly Ser lie Thr lie Asp Gly 
65 70 75 80 

lie Ser Leu Thr Val Ala Lys Val Ser Lys Val Asn Phe Ser Val Ser 

85 90 95 

Val lie Pro His Thr Leu Lys Gin Thr lie Leu Lys Ser Lys Gin Val 

100 105 110 

Gly Ser Thr Val Asn Leu Glu Asn Asp lie Leu Gly Lys Tyr Val Gin 

115 120 125 

Lys Leu Met Asp Asn Ser Pro Lys Ser Glu lie Ser Lys Glu Leu Leu 

130 135 140 

Tyr Gin Asn Gly Phe 
145 



<2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : 1 inear 

<ii> MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 



Met Asn Trp Ser Ala Leu Glu Gin Gin lie Val Asp Glu Leu Lys lie 

15 10 15 

Tyr lie Ala Pro Lys lie Phe Gly Gly Ser Ala Lys Phe Pro Val Gly 

20 25 3 0 
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Gly Glu Gly He Ser Leu Pro Asn Asp Ala He Arg Leu Lys Pro Tyr 

35 40 45 

Ala Phe Ser Gin Xaa Gly Xaa Asp Tyr Leu He Glu Ser Glu Val He 

50 55 60 

Tyr Pro Cys Ser Gin Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 25 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

Met: Val Leu Phe Leu Val Pro Gin Leu Arg Asn Ala Tyr Gly Thr Ala 

1 5 10 15 

Ala lie Gly He lie Cys Gly Leu Tyr Trp Ala Val Ser Ser Asn Met 

20 25 30 

Thr Val Glu Ala Thr Gin Arg Leu Thr Gly Gly Gly Gly Phe Ala He 

35 40 45 

Gly His Gin Gin Gin Phe Ala He Trp Phe Val Asp Lys Val Ala Gly 
50 55 60 



Arg Phe Gly Lys Lys Glu Glu Ser Leu Asp Asn Leu Lys Leu Pro Lys 
" 70 75 80 

Phe Leu Ser He Phe His Asp Thr Val Val Ala Ser Ala Thr Phe Asp 



85 90 



Ala Arg He Leu Arg Gly His Ser Phe Asn Leu Gly Ser Arg hIs Tyr 
100 105 110 

Val 



(2) INFORMATION FOR SEQ ID NO: 25 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE : protein 



PCT/US97/05306 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 

Met Leu lie Asn Ser Leu Asn Gin Ser Tyr lie Ser Ala Lys Arg lie 

15 10 15 

Glu Glu Val Phe Ala Glu Ser Pro Glu Asn lie His Ser Glu Leu Glu 

20 25 30 

Gin Lys Gin Val Thr Ser Gly Arg Val Leu Gin Val Gin Glu Leu Thr 

35 40 45 

Phe Thr Tyr Pro Asp Ala Ala Gin Pro Ser Leu Arg Asp lie Ser Phe 

50 55 60 

Asp Met Thr Gin Gly Gin lie Leu Gly lie lie Gly Gly Thr Gly Ser 
65 70 75 80 

Gly Lys Ser Ser Leu Val Gin Leu Leu Leu Gly Leu Tyr Pro Val Asp 

85 90 . 95 

Lys Gly Asn lie Asp Leu Tyr Gin Asn Gly Arg Ser Pro Leu Asn Leu 

100 105 110 

Glu Gin Trp Arg Ser Trp lie Ala Tyr Val Pro Gin Lys Val Lys Leu 

115 120 125 

Phe Lys Gly Thr lie Arg Ser Asn Leu Thr Leu Gly Leu Asn Gin Glu 

130 135 140 

Val Ser Asp Gin Lys Leu Trp Gin Ala Leu Glu lie Ala Gin Ala Lys 
145 150 155 160 

Asp Phe Val Ser Glu Lys Glu Gly Leu Leu Asp Ala Leu lie Glu Ala 

165 170 175 

Gly Gly Arg Asn Phe Ser Gly Gly Gin Lys Gin Arg Leu Ser lie Ala 

180 185 190 

Arg Ala Val Leu Arg Gin Ala Pro Phe lie lie Leu Asp Asp Ala Thr 

195 200 205 

Ser Ala Leu Asp Thr lie Thr Glu Ser Lys Leu Leu Lys Ala lie Arg 

210 215 .220 

Glu Asn Phe Pro Asn Thr Ser Leu lie Leu lie Ser Gin Arg Thr Ser 
225 230 235 240 

Thr Leu Gin Met Ala Asp Gin lie Leu Leu Leu Glu Lys Gly Glu Leu 

245 250 255 

Leu Ala Val Gly Lys His Asp Asp Leu Met Lys Ser Ser Gin Val Tyr 



260 



265 



270 



Arg Glu He 
275 



Asn 



Ala Ser 



Gin 



His Gly 
280 



Lys 



Glu 



Asp 



(2) INFORMATION FOR SEQ ID 



NO: 2 57: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

Met Lys Asp Pro Glu Ser Arg Leu Tyr He Leu Leu Lys Asp Gly Gin 

1 5 10 15 

Val He Gly Thr Cys Thr Val Asp Leu Ser Thr Asn Thr Asn Tyr Phe 

20 " 30 

Tyr Gly Leu Ala He Leu Glu Pro Glu Arg Gly Lys Gly Tyr Gly Ser 

35 40 45 

Tyr Leu Ala Lys Ser Leu Val Asn Gin Leu He Glu Gin Asn Asp Lys 

50 55 60 

Glu Phe Gin lie Ala Val Glu Asp Ser Asn Val Gly Ala Lys Arg Leu 



65 

80 



70 75 



Tyr Glu Lys He Gly Phe Val Lys Gin Thr Gin Val Val Tyr Leu Asn 

85 90 95 

Glu Lys Gly Ala Arg Asp Ser Glu Val 

100 105 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Met Phe Ala Leu Ser Ser Asn Lys Glu Leu Ala Glu Arg Val Ala Gin 

Glu lie Gly He Glu Leu Gly Lys Ser llr Val Arg Gin Phe Ser Asp 

20 25 
Gly Glu lie Gin val Asn He Glu Glu Ser He Arg Gly Lys His Val 

Phe lie Leu Gin Ser Thr Ser Ser Pro Val Asn Asp 1L Leu Leu Glu 



55 60 
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lie Leu lie Met Val Asp Ala Leu Lys Arg Ala Ser Ala Glu Ser Val 
65 70 75 80 

Asn Val Val Met Pro Tyr Tyr Gly Tyr Ala Arg Gin Asp Arg Lys Ala 

85 90 95 

Arg Ala Arg Glu Pro lie Thr Ser Lys Leu Val Ala Asn Met Leu Glu 

100 105 110 

Val Ala Gly Val Asp Arg Leu Leu Thr lie Asp Leu His Ala Ala Gin 

115 120 125 

lie Gin Gly Phe Phe Asp lie Pro Val Asp His Leu Met Gly Ala Pro 

130 135 140 

Leu Xle Ala Asp Tyr Phe Glu Arg Pro Gly Met Val Gly Ser Asp Tyr 
145 150 155 160 

Val Val Val Ser Pro Asp His Gly Gly Val Thr Arg Ala Arg Lys Leu 

165 170 175 

Ala Glu Phe Leu Lys Thr Ser lie Ala lie lie Glu Lys Arg Arg Ser 

180 185 190 

Val Asp Lys Met Asn Thr Ser Glu Val Met Asn Thr He Gly Lys Val 

195 200 205 

Glu Gly Asn His Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 2 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) 'LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

Met He Ser Phe Leu Leu Leu Leu Val Leu Val Trp Gly Phe Tyr He 

1 5 10 15 

Gly Tyr Arg Arg Gly Leu Leu Leu Gin Val Tyr Tyr Leu He Ser Ala 

20 25 30 

Met Ala Ser Ala Phe Met Ala Gly Gin Phe Tyr Lys Gly Leu Gly Glu 

35 40 45 

Gin Phe His Leu Leu Leu Pro Tyr Ala Asn Ser Gin Glu Gly Gin Gly 

50 55 60 

Thr Phe Phe Phe Pro Ser Asp Gin Leu Phe Gin Leu Asp Lys Val Phe 
65 70 75 80 
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Tyr Ala Gly lie Gly Tyr Leu Leu 
85 

Gly Arg Leu Leu Gly Leu Leu Leu 
100 

Gly Gly Lys Leu Phe Gin Val Ser 
115 120 
Thr Leu Phe Val Leu Gin Met Ala 

130 135 
Met Ala Val lie Gin Asn Pro Leu 
145 150 
lie He Gin Ser lie Pro He Thr 
165 

Val Thr Asn Leu lie Gly 
180 
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Val Phe Gly He Val Tyr Ser He 

90 95 
His Leu He Pro Ser Lys Lys Leu 
105 HO 
Ala Gly He Leu Ser Met Leu Val 
125 

Leu Thr He Leu Ala Thr He Pro 
140 

Glu Lys Ser He Val Ala Lys His 
155 i 6 o 
Thr Ser Trp Leu Lys Gin He Trp 
170 175 



(2) INFORMATION FOR SEQ ID NO: 2 60: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 60: 

Met Arg Ala Phe Ser Asp Phe Gly Thr Pro Met Leu He Gly Glu Gly 

1 5 10 15 

Tyr Arg Thr Phe Pro Val Leu He Tyr Thr Gin Phe He Ser Glu Val 

20 25 30 

Gly Gly Asn Ser Ala Phe Ala He Met Ala He He He Ala Leu Ala 

35 40 45 

He Phe Leu He Gin Lys His He Ala Asn Arg Tyr Ser Phe Ser Met 

50 55 60 

Asn Leu Leu His Pro He Glu Pro Lys Lys Thr Thr Lys Gly Lys Met 
65 70 75 ao 

Ala Ala He Tyr Ala Thr Val Tyr Gly He He Phe He Ser Val Leu 

85 90 95 

Pro Gin He Tyr Leu He Tyr Thr Ser Phe Leu Lys Thr Ser Gly Met 

100 105 110 

Val Phe Val Lys Gly Tyr Ser Pro Asn Ser Tyr Lys Val Ala Phe Asn 
115 120 125 
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Arg Met Gly Ser Ala He Phe Asn Thr He Arg He Pro Leu He Ala 

130 135 140 

Leu Val Leu Val Val Pro He Tyr Asp He Tyr Leu Leu Pro Ser Arg 
145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 2 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

Met Val Pro Tyr lie Val Pro Gly Thr Val Leu Gly He Ala Phe He 

1 5 10 15 

Ser Ser Phe Asn Thr Gly Leu Phe Gly Ser Gly Phe Leu Met He Thr 

20 25 30 

Gly Thr Ala Phe He Leu He Met Ser Leu Ser Val Arg Arg Leu Pro 

35 40 45 

Tyr Thr He Arg Ser Ser Val Ala Ser Leu Gin Gin He Ala Pro Ser 

50 55 60 

He Glu Glu Ala Ala Gly Lys Leu Arg Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 62: 

Met He Phe He Pro Met Ala Ala Tyr Ser He Ala Arg Asn Met Ser 

15 10 15 

Lys Arg Lys Ala Phe Thr He His Val Tyr Pro Leu Asn Pro Arg Asn 
20 25 30 
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Leu Arg Thr Phe Pro Ser His His Asp Ser Asp Tyr Gly Tyr Asp Glu 

35 40 45 

(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

Met Lys Phe Arg Lys Leu Ala Cys Thr Val Leu Ala Gly Ala Ala Val 

1 5 " X5 

Leu Gly Leu Ala Ala Cys Gly Asn Ser Gly Gly Ser Lys Asp Ala Ala 

20 25 30 

Lys Ser Gly Gly Asp Gly Ala Lys Thr Glu He Thr Trp Trp Ala Phe 



35 " 45 



Pro Val Phe Thr Gin Glu Lys Thr Gly Asp Gly Val Gly Thr Tyr Glu 

50 55 60 

Lys Ser He He Gin Ala Phe 
65 70 

(2) INFORMATION FOR SEQ ID NO: 2 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

Met Asn Ser Gin Gin Lys Pro Gin Glu He Lys Val Glu Glu Pro Val 

5 10 15 

Glu ser Lys Glu Glu Thr Val Asn Gin Pro Val Glu Gin Pro Lys Val 

25 20 
Glu Thr Pro Ala Val Glu Lys Gin Thr Glu Pro Thr Glu Glu Pro Lys 



35 
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Val Glu Val Thr Ser lie Pro Gin Thr Thr Arg Tyr Glu Glu Asp Leu 

50 55 - 60 

Thr Lys Glu His Gly Thr Arg Glu Val Val Lys Glu Gly Lys Asn Gly 
65 70 75 80 

Ser Arg Thr Val Thr Thr Pro Tyr lie Leu Asn Ala Thr Asp Gly Thr 

85 90 95 

Thr Thr Glu Gly Thr Ser Thr Thr Asp Glu Ala Glu Met Glu Lys Glu . 

100 105 110 

Val Val Arg Val Gly Thr Lys Pro Lys Glu Lys Leu Ala Pro Val Leu 

115 120 125 

Ser Leu Thr Ser Val Thr Asp Asn Ala Met Leu Arg Ser Ala Arg Leu 

130 135 140 

Thr Tyr His Leu Glu Asn Thr Asp Ser Val Asp Val Lys Lys lie His 
145 150 155 160 

Ala Glu lie Lys Asn Gly Asp Lys Val Val Lys Thr lie Asp Leu Ser 

165 170 . 175 

Lys Glu Arg Leu Ser Asp Ala Val Asp Gly Leu Glu Leu Tyr Lys Asp 

180 185 190 

Tyr Lys He Val Thr Ser Met Thr Tyr Asp Arg Gly Asn Gly Glu Glu 

195 200 205 

Thr Ser Thr Leu Glu Glu Thr Pro Leu Arg Leu Asp Leu Lys Lys Val 

210 215 220 

Glu Leu Lys Asn He Gly Ser Thr Asn Leu Val Lys Val Asn Glu Asp 
225 230 235 240 

Gly Thr Glu Val Ala Ser Asp Phe Leu Thr Ser Lys Pro Val Asp Val 

245 250' 255 

Gin Asn Tyr Tyr Leu Lys Val Thr Ser Arg Asp Asn Lys Val Val Ser 
260 265 270 

Pro Pro Ser 
275 

(2) INFORMATION FOR SEQ ID NO: 2 65: 



{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION : SEQ . ID NO: 26 5: 
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Met Ala Val Met Lys rie Glu Tyr Tyr Ser Gin Val Leu Asp Met Glu 

Trp Gly val Asn Val Leu Tyr Pro Asp Ala Asn Arg Val Glu Glu Pro 

20 25 30 

Glu Cys Glu Asp He Pro Val Leu Tyr Leu Leu His Gly Met Ser Gly 



* u 45 
Asn His Asn Ser Trp Leu Lys Arg Thr Asn Val Glu Arg Leu Leu Arg 

50 5 5 60 

Gly Thr Asn Leu lie Val Val Met Pro Asn Thr Ser Asn Gly Trp Tyr 
65 70 75 

Thr Asp Thr Gin Tyr Gly Phe Asp Tyr Tyr Thr Ala Leu Ala Glu Glu 

85 90 95 

Leu Pro Gin Val Pro Glu Thr Leu Leu Pro 

100 105 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 66: 

Met Arg Leu Leu Ala Met Lys Met Lys Gin He Ser Asp Thr Thr Leu 

Lys lie Thr Met Ser Leu Glu Asp Leu Met Asp Arg Gly Met Glu He 

20 25 
Ala Asp Phe Leu Val Pro Gin Glu Lys Thr Glu Glu Phe Phe Tyr Ala 

35 dC\ 



He Leu Gly 
50 



(2) INFORMATION FOR SEQ ID NO: 2 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

Met Leu Pro lie He Cys Cys Gly Glu Ser Leu Glu Thr Tyr Glu Ala 

1*5 10 15 

Gly Lys Ala Ala Glu Phe Val Gly Ala Gin Val Ser Ala Ala Leu Ala 

20 25 30 

Gly Leu Thr Ala Glu Gin Val Ala Ala Ser Val He Ala Tyr Glu Pro 

35 40 45 

He Trp Ala He Gly Thr Gly Lys Ser Ala Ser Gin Asp Asp Ala Gin 

50 55 60 

Lys Met Cys Lys Val Val Arg Asp Val Val Ala Ala Asp Phe Gly Gin 
65 70 75 80 

Glu Val Ala Asp Lys Val Arg Val Gin Tyr Gly Gly Ser Val Lys Pro 

85 90 95 

Glu Asn Val Ala Ser Tyr Met Ala Cys Pro Asp Val Asp Gly Ala Leu 

100 105 110 

Val Gly Gly Ala Ser Leu Glu Ala Glu Ser Phe Leu Ala Leu Leu Asp 

115 120 125 

Phe Val Lys 
130 



(2)« INFORMATION FOR SEQ ID NO:268: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 68: 



Met Phe Val Glu 
1 

Val Pro Pro Tyr 
20 

He Gly Val Gly 
35 

Phe Thr He Phe 
50 



Ala Cys Lys Ala 
5 

Gly He Gly Gly 

Asp He He Gly 
40 

Ala Met Pro Val 
55 



Val Val Arg Ala 
10 

Thr Leu Tyr Leu 
25 

Val Lys Pro Ala 

Gly Asn Tyr Phe 
60 



Asn Glu Glu Tyr 
15 

Arg Pro Leu Leu 
30 

Glu Glu Tyr lie 
45 

Lys Gly Gly Leu 
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Val Pro Thr Asn Phe Leu lie Gin Asp Glu Tyr Asp Arg Ala Ala Pro 



65 70 75 



80 



Asn Gly Thr Gly Ala Ala Lys Val Gly Gly Asn Tyr Ala Ala Ser Leu 

85 90 95 

Leu Pro Gly Lys Met Ala Lys Ser Arg His Phe Ser Asp Val He Tyr 

100 110 
Leu Asp Pro Ser Thr His Thr Lys lie Glu Glu Val Gly Ser Ala Asn 

115 120 125 

Phe Phe Gly He Thr Ala Asp Asn Glu Phe Val Thr Pro Leu Ser Pro 

130 135 140 

Ser He Leu Pro Ser lie Thr Lys Tyr Ser Leu Leu Tyr Leu Ala Glu 

150 155 160 

Hrs Arg Leu Gly Leu Thr Pro He Glu Gly Asp Val Pro He Asp Asn 

165 170 175 

Leu Asp Arg Phe Val Lys Ala Gly Ala Cys Gly Thr Ala Ala Val He 

180 185 
Ser Pro lie Gly Gly He Gin His Gly Asp Asp Phe His HI xie Leu 
195 200 2 05 

(2) INFORMATION FOR SEQ ID NO : 2 6 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

Met val Met He Ser Met Leu Phe Tyr Ser Glu Thr Glu Val Gly P ro 

5 10 15 

Val Thr Arg Lys Leu Tyr Asn Glu Leu Thr Gly He Gin Phe Gly Asp 
20 



25 



He Glu Ala Pro Glu Gly Trp He Val Lys Val Asp 



30 



35 40 



(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

Met Thr Ala Ser Pro Leu Lys Lys Ser lie Lys Lys Lys Asn Arg Lys 

1 5 10 15 

Leu Thr Asn Lys Xaa Glu Lys His Met Ser Lys lie lie Gly He Asp 

20 25 30 

Leu Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser 

35 40 45 

Lys lie He Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val 

50 55 60 

Ser Phe Lys Asn Gly Glu He He Val Gly Asp Ala Ala Lys Arg Gin 
65 70 75 80 

Ala Val Thr Asn Pro Asp Thr Val He Ser lie Lys Ser Lys Met Gly 

85 90 95 

Thr Ser Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu 

100 105 110 

He Ser Ala Met He Leu Gin Tyr Leu Lys Gly Tyr Ala Glu Asp Tyr 

115 120 125 

Leu Gly Glu Lys Val Thr Lys Ala Val He Thr Val Pro Ala Tyr Phe 

130 135 140 

Asn Asp Ala Gin Arg Gin Ala Thr Lys Asp Ala Gly Lys He Ala Gly 
145 150 * 155 160 

Leu Glu Val Xaa Arg He Val Asn Glu Pro Thr Ala Ala Ala Leu Ala 

165 170 175 

Tyr Gly Leu Asp Lys Thr Asp Lys Glu Glu Lys He Leu Val Phe Asp 

180 185 190 

Leu Gly Gly Gly Thr Phe Asp Val Ser He Leu Glu Leu Gly Asp Gly 

195 200 205 

Val Phe Asp Val Leu Ser Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp 

210 215 220 

Asp Phe Asp Gin Lys He He Asp His Leu Val Ala Glu Phe Lys Lys 
225 230 235 240 

Glu Asn Gly He Arg Leu Val Tyr 
245 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: . 
(A) LENGTH: 109 amino acids 
<B) TYPE: amino acid 
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<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

Met Lys Leu He Val Asp Leu He Tyr Glu Gly Gly Phe Lys Lys Met 

Arg Gin Ser He Ser Asn Thr Ala Glu Tyr Gly Asp Tyr Val Ser Gly 

20 25 30 

Pro Arg Val He Thr Glu Gin Val Lys Glu Asn Met Lys Ala Val Leu 

35 40 45 

Ala Asp He Gin Asn Gly Lys Phe Ala Asn Asp Phe Val Asn Asp Tyr 

50 55 60 

Lys Ala Gly Arg Pro Lys Leu Thr Ala Tyr Arg Glu Gin Ala Ala Asn 
65 70 75 80 

Leu Glu He Glu Lys Val Gly Ala Glu Leu Arg Lys Ala Met Pro Phe 

95 



85 90 



Val Gly Lys Asn Asp Asp Asp Ala Phe Lys He Tyr Asn 
100 105 

(2) INFORMATION FOR SEQ ID NO: 272: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
Met Leu Leu Ser Ala Phe His Lys Tyr Glu Asn Gin Leu Asp Lys Val 

Arg Phe He Gly Leu His Thr Gly His Leu Gly Phe Tyr Thr Asp Tyr 

^ ^ ^ ^ 30 

Arg Asp Phe Glu Leu Asp Lys Leu Val Thr Asn Leu Gin Leu Asp Thr 

" 40 45 

Gly Ala Arg Val Ser Tyr Pro Val Leu Asn Val Lys Val Phe Leu Glu 
50 55 60 

Asn Gly Glu Val Lys He Phe Arg Ala Leu Asn Glu Ala Ser He Ara 

65 70 7S 

75 80 

219 



WO 97/37026 PCTOJS97/05306 
Arg Ser Asp Arg Thr Met Val Ala Asp lie Val lie Asn Gly Val Pro 

85 90 95 

Phe Glu Arg Phe Arg Gly Asp Gly Leu Thr Val Ser Thr Pro Thr Gly 

100 105 110 

Ser Thr Ala Tyr Asn 
115 

(2) INFORMATION FOR SEQ ID NO:273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2 73: 

Met Gly Ala Gin Leu Ala Arg Glu Phe Lys His Glu Ala Asp lie Val 

1 5-10 15 

Val Gly Val Pro Asn Ser Ser Leu Ser Ala Ala Met Gly Phe Ala Glu 

20 . 25 30 

Glu Ser Gly Leu Pro Asn Glu Met Gly Leu lie Lys Asn Gin Tyr Thr 

35 40 45 

Gin Arg Thr Phe He Gin Pro Thr Gin Glu Leu Arg Glu Gin Gly Val 

50 55 60 

Arg Met Lys Leu Ser Ala Val Ser Gly Val Val Lys Gly Lys Arg Val 
65 70 75 80 

Val Met Val Asp Asp Ser lie Val Arg Gly Thr Thr Ser Arg Arg lie 

85 90 95 

Val Gin Leu Leu Lys Glu Ala Gly Ala Thr Glu Val His Val Ala lie 

100 105 110 

Gly Ser Pro Ala Leu Ala Tyr Pro Cys Phe Tyr Gly He Asp He Gin 

115 120 125 

Thr Arg Gin Glu Leu He Ala Ala Asn His Thr Val Glu Glu Thr Arg 

130 135 140 

Gin He He Gly Ala Asp Ser Leu Thr Tyr Leu Ser He Asp Ser Leu 
145 150 155 160 

He Glu Ser He Gly He Glu Thr Asp Ala Pro Asn Gly Gly Leu Cys 

165 170 175 

Val Ala Tyr Phe Asp Gly Asp Tyr Pro Thr Pro Leu Tyr Asp Tyr Glu 
180 185 190 
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Glu Asp Tyr Arg Arg Ser Leu Glu Glu Lys Thr Ser Phe Tyr Lys 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 274: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Jci) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 
Met Lys lie Leu Val Thr Gly Phe Asn Pro Phe Gly Gly Glu Lys He 

1 5 io 15 

Asn Pro Ala Leu Glu Ala Val Lys Leu Leu Pro Ser Glu He Asn Gly 

20 25 30 

Ala Glu val Arg Trp Val Glu He Pro Thr Val Phe Tyr Lys Ser Ser 

35 40 45 

Glu Val Leu Glu Ala Glu He Leu Arg Tyr Gin Pro Asp Ala Val Leu 

50 55 60 

Cys He Gly Gin Ala Gly Gly Arg Thr Gly Leu Thr Pro Glu Arg Val 
" 70 75 80 

Ala He Asn Gin Asp Asp Ala Arg He Pro Asp Asn Glu Gly Asn Gin 

85 90 95 

Pro He Asp Thr Pro He Arg He Asp Gly Ala Ser Ala Tyr Phe Ser 

100 105 110 

Ser Leu Pro He Lys Ala Met: Val Gin Ala Asn Lys Lys Glu Gly Leu 

115 I 20 125 

Thr Gly ser Leu Phe Pro He Arg Ala Gly Thr Phe Val Cys Ser His 
130 135 140 



Leu Met Tyr Gin Ala Leu Tyr Leu Val Glu Lys Lys Phe Pro Tyr Vai 

Lys Ala Gly Phe Met His He Pro Tyr Met Met Glu Gin Val Val Asn 

a „ 170 175 

Arg Pro Thr Thr Pro Thr Met Ser Leu Val Asp He Arg Arg Gly He 



Glu Ala Ala lie Gly Ala Met He Glu His Gly Asp Gin Glu Leu Thr 

195 200 2 05 

Leu Val Gly Gly Glu He His 

210 215 
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(2) INFORMATION FOR SEQ ID NO: 27 5: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 



Met Leu Val Pro Lys Arg Val Lys His Arg Arg Glu Phe Arg Gly Lys 

15 10 15 

Met Arg Gly Glu Ala Lys Gly Gly Lys Glu Val Ala Phe Gly Glu Tyr 

20 25 30 

Gly Leu Gin Ala Thr Thr Ser His Trp lie Thr Asn Arg Gin lie Glu 

35 40 45 

Ala Ala Arg lie Ala Met Thr Arg Tyr Met Lys Arg Gly Gly Lys Val 

50 55 60 

Trp lie Lys lie Phe Pro His Lys Ser Tyr Thr Ala Lys Ala lie Gly 
65 70 75 80 

Val Arg Met Gly Ser Gly Lys Gly Ala Pro Glu Gly Trp Val Ala Pro 

85 90 95 

Val Lys Arg Gly. Lys Val Met Phe Glu lie Ala Gly Val Ser Glu Glu 

100 105 110 

lie Ala Arg Glu Ala Leu Arg Leu Ala Ser His Lys Leu Pro Val 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

Met Gly Trp Trp Arg Glu Thr lie Asp He Val Lys Glu Asn Asp Pro 
15 10 15 
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Ala Ala Arg Thr Thr Leu Glu Val Leu Leu Thr Tyr Pro Gly Val Lys 

20 25 30 

Ala Leu Ala Ala His Arg Leu Ser His Phe Leu Trp Lys Tyr Asp Phe 

35 40 45 

Lys Leu Leu Ala Arg Met His Ser Gin Phe Trp Arg Phe Trp Thr Gin 

50 55 60 

lie Glu He His Pro Gly Ala Gin He Asp Ser Gly Val Phe He Asp 
65 70 4 75 80 

His Gly Ser Gly Leu Val He Gly Glu Thr Ala He Val Glu Lys Gly 

85 90 95 

Val Leu Leu Tyr His Gly Val Thr Leu Gly Gly Thr Gly Lys Asp Cys 

100 105 no 

Gly Lys Arg His Pro Thr Val Arg Lys Gly Ala Leu He Ser Ala His 

115 120 125 

Ala Gin Val He Gly Pro Val Glu He Gly Glu Asn Ala Lys Val Gly 

130 135 140 

Ala Ala Ala Val Val Val Ala Asp Val Pro Ser Asp Val Thr Val Val 
145 150 155 160 

Gly lie Pro Ala Lys He Val Arg Leu His Gly Lys Lys Asp Glu Pro 

165 170 175 

Val He His Glu Val Glu Glu Lys Arg Glu Tyr Tyr Val Asn Lys Leu 

180 i 8 5 190 

Glu Gin Ala Lys Asp Ala Ser His Arg Ser Ser Gly Leu 
i95 200 205 

(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

Met Leu Phe Tyr Leu Leu Arg Asp Gly Lys Gly Leu Arg Asn Tyr Leu 

1 5 10 15 

Thr Gin Phe He. Pro Ser Lys Leu Lys Glu Pro Val Gly Gin Val Leu 

20 25 30 

Ser Asp Val Asn Gin Gin Leu Ser Asn Tyr Val Arg Gly Gin Val Thr 
35 40 45 
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Val Ala lie He Val Ala Val Met Phe He He Phe Phe Lys He He 

50 55 60 

Gly Leu Arg Tyr Ala Val Thr Leu Gly Val Thr Ala Gly He Leu Asn 
65 70 75 80 

Leu Val Pro Tyr Leu Gly Ser Phe Leu Ala Met Leu Pro Ala Leu Val 

85 90 95 

Leu Gly Leu He Ala Gly Pro Val Met Leu Leu Lys Val Val He Val 

100 105 no 

Phe He Val Glu Gin Thr He Glu Gly Arg Phe Val Ser Pro Leu He 

H5. ' 120 125 

Leu Gly Ser Gin Leu Asn He His Pro He Asn Val Leu Phe Val Leu 

130 135 140 

Leu Thr Ser Gly Ser Met Phe Gly He Trp Gly Val Leu Leu Gly He 
145 150 155 160 

Pro Val Tyr Ala Ser Ala Lys Val Val He Ser Ala He Phe Glu Trp 

165 170 175 

Tyr Lys Val Val Ser Gly Leu Tyr Glu Leu Glu Gly Glu Glu Val Lys 
180 185 190 

Ser Glu Gin 
195 

(2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 8: 

Met Thr Lys Gin Val Pro Lys Phe Thr Lys Asp Thr Ala Gin Leu Tyr 

15 10 15 

Thr Cys Lys Trp Leu Leu Tyr Asn Lys Val Thr Lys Met Tyr Asp His 

20 25 30 

Thr Val Val Asn His Ser Val Arg Glu Tyr lie Thr Asp Ser lie Ser 

35 40 45 

Thr Asn Thr lie Glu Glu Thr Gly 
50 55 

(2) INFORMATION FOR SEQ ID NO: 279: 



224 



WO 97/37026 PCT/US97/05306 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: 

Met Lys Lys Arg Ala lie Val Ala Val lie Val Leu Leu Leu lie Gly 

1 5 10 15 

Leu Asp Gin Leu Val Lys Ser Tyr He Val Gin Gin He Pro Leu Gly 

20 25 30 

Glu Val Arg Ser Trp He Pro Asn Phe Val Ser Leu Thr Tyr Leu Gin 

35 40 45 

Asn Arg Gly Ala Ala Phe Ser He Leu Gin Asp Gin Gin Leu Leu Phe 

50 55 60 

Ala Val lie Thr Leu Val Val Val He Gly Ala He Trp Tyr Leu His 
65 7 0 75 so 

Lys His Met Glu Asp Ser Phe Trp Met Val Leu Gly Leu Thr Leu He 

85 90 95 

He Ala Gly Gly Leu Gly Asn Phe He Asp Arg Val Ser Gin Gly Phe 

100 105 no 

Val Val Asp Met Phe His Leu Asp Phe He Asn Phe Ala He. Phe Asn 

115 120 125 

Val Ala Asp Asn Tyr Leu Thr Val Gly Val He He Leu Leu He Ala 

I 30 135 140 

Met Leu Lys Glu Glu He Asn Gly Asn 
145 150 

(2) INFORMATION FOR SEQ ID NO: 280 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 
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Met Gly Ala Tyr Tyr Leu Val Met Gin Ser Leu Ser Tyr Leu Glu Tyr 

1 5 10 15 

Glu Gin Gly lie Gin Ser Thr. Thr Val Arg His Leu lie Leu Val Phe 

20 25 30 

Tyr Leu Leu Phe Phe Met Gly Gly lie Lys Lys Leu Asp Thr Tyr Leu 

35 40 45 

Lys Glu Lys Leu Gin Glu Glu Leu Asn Gin Glu Gin Thr Leu Arg Tyr 

50 55 60 

Arg Asp Met Glu Arg Tyr Ser Arg His lie Glu Glu Leu Tyr Lys Glu 
65 70 75 80 

lie Arg Ser Phe Arg His Asp Tyr Thr Asn Leu Leu Thr Thr Tyr Val 

85 90 95 

Trp Ala Leu Lys Arg Arg lie Trp Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 281: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 81: 

Met His Val Arg Met lie Pro Lys Ser Thr Pro Asp Thr Lys Phe Ala 

1 5 10 15 

Asp Val Ala Thr His Gin Pro Glu Tyr Ser Arg Asp Asn Val Ala Gly 

20 25 30 

Thr lie Val Gly Phe Trp Thr Pro Glu lie Phe His Gly Val Ser Val 

35 40 45 

Ala Gly Tyr . His Leu His Phe lie Ser Asp Asp Leu Thr Phe Gly Gly 

50 55 60 

His Val Met Asp Phe Val lie Lys Glu Gly He He Glu Val Gly Ala 
65 70 75 80 

Val Asp Gin Leu Asp Gin Arg Phe Pro Val Gin Asp Arg Gin Tyr Leu 

85 90 95 

Phe Ala Lys Phe Asn Val Asp Glu Met Lys Lys Asp He Glu Lys Ala 
100 105 110 

Glu 
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(2) INFORMATION FOR SEQ ID NO: 282: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 

Met Ala Val Ala Lys Gly Lys Leu Thr He He Ala His Val Ala Cvs 

1 5 10 15 

Asn Asn Thr Lys Asp Ser Met Glu Leu Ala Arg His Ala Glu Ser Leu 

Gly Val Asp Ala lie Ala Thr Asp Ser Thr Asn Leu Phe Pro Leu Ala 

35 4 ° 45 

Arg He Thr Gin Leu Pro Asn Thr Gly Thr He Ser Val Leu Gin Leu 

50 55 60 

Gin Thr Gin Thr Thr 
65 



(2) INFORMATION FOR SEQ ID NO: 2 83: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

Met Gly Glu Thr Gin lie He Asp Gly' Leu Asp Pro Glu Tyr Lys Lys 

Arg P he M et His His Tyr Asn Phe Pro Ql„ Tyr Ser Val Gly Glu Thr 

20 25 
Gly Arg Tyr Gly Ala Pro Gly Arg Arg Glu He Gly His Gly Ala Leu 

35 40 45 

Gly Glu Arg Ala Leu Ala Gin Val Leu Pro Ser Leu Glu Glu Phe Pro 



50 cc 

55 60 
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Tyr Ala lie Arg Leu Val Ala Glu Val Leu Glu Ser Asn Gly Ser Ser 
65 70 75 80 

Ser Gin Ala Ser lie Cys Ala Gly Thr Leu Ala Leu Met Ala Gly Gly 

85 90 95 

Val Pro lie Lys Ala Pro Val Ala Gly lie Ala Met Gly Leu lie Ser 

100 105 110 

Asp Gly Asn Asn Tyr Thr Val Leu Thr Asp lie Gin Gly Leu Glu Asp 

115 120 125 

His Phe Gly Asp Met Asp Phe Lys Val Ala Gly Thr Arg Asp Gly lie 

130 135 140 

Thr Ala Leu Gin Met Asp lie Lys lie Gin Gly lie Thr Ala Glu lie 
145 150 155 160 

Leu Thr Glu Ala Leu Ala Gin Ala Lys Lys Ala Arg Phe Glu lie Leu 

165 170 175 

Asp Val lie Glu Ala Thr lie Pro Glu Val Arg Pro Glu Leu Ala Pro 

180 185 190 

Thr Ala Pro Lys lie Asp Thr lie Lys lie Asp Val Asp Lys lie Lys 

195 200 205 

lie Val He Gly Lys Gly Gly Glu Thr He Asp Lys He He Ala Glu 

210 215 220 

Thr Gly Val Lys He. Asp He Asp Glu Glu Xaa Asn Val Phe Tyr Leu 
225 230 235 240 

Leu Leu Val Asp Gin Asn Ala He. Asn Pro Cys Pro Lys Lys Leu Leu 

245 250 255 

Leu Val Trp Phe Arg Glu Pro Lys Trp Met Lys Phe Thr Val Leu Asn 

260 265 270 

Arg Ser Tyr Arg Glu Phe Gly Ala Phe Val Thr Leu 
275 280 

(2) INFORMATION FOR SEQ ID NO: 2 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 84: 

Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
15 10 15 
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Leu Arg. Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 

20 25 3 0 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 

35 40 45 

lie Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 

50 55 60 

Ala Lys Lys Gly Ser Leu lie Gly Val Thr Gly Arg lie Gin Thr Arg 



65 70 75 



80 



Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 

85 90 95 

Ala Glu Asn Phe Gin Met Leu Glu Ser Arg Ser Val Arg Glu Gly His 

ioo 105 11Q 

Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala Pro Thr 

115 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 
130 135 14Q 

Asn Pro Leu Asp lie Ser Xaa Asp Asp Leu Pro Phe 
1*5 150 155 

(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Met Asp Leu Val Arg lie Leu Lys Glu Pro Arg Asn Ala Leu Val Lys 

1 5 10 15 

Gin Tyr Gin Thr Leu Leu Ser Tyr Asp Asp Val Glu Leu Glu Phe Asp 
20 25 30 



Asp 


Glu 


Ala 


Leu 


Gin Glu lie Ala 


Asn 


Lys 


Ala 


He 


Glu 


Arg 


Lys 


Thr 


Gly 




35 




40 










45 






Ala 
50 


Arg 


Gly 


Leu Arg Ser lie 


He 


Glu 


Glu 


Thr 


Met 


Leu 


Asp 


Val 








55 








60 










Met 

65 

Lys 


Phe 
Glu 


Glu 
Thr 


Val 
Val 


Pro Ser Gin Glu 
70 

Asp Gly Thr Asp 


Asn 
Lys 


Val 
Pro 


Lys 

75 

He 


Leu 
Leu 


Val 
Glu 


Arg 
Thr 


He 
Ala 


Thr 
80 



85 90 



95 
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(2) INFORMATION FOR SEQ ID NO: 286: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 86: 



Met Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn lie Leu Leu 

15 10 15 

Lys Leu Leu Gin Val Ala Asp Phe Asn lie Glu Arg Ala Glu Arg Gly 

20 25 30 

lie lie Tyr Val Asp Glu lie Asp Lys lie Ala Lys Lys Ser Glu Asn 

35 40 45 

Val Ser lie Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala Leu 

50 55 60 

Leu Lys lie He Glu Gly Thr Val Ala Ser Val Pro Pro Gin Gly Gly 
65 70 75 80 

Arg Lys His Pro Gin Gin Val Asp Asp Ser Ser Gly Tyr Lys Lys Tyr 

85 90 95 

Pro Leu His Arg. Gly Trp Cys Phe 
100 



(2) INFORMATION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:287: 

Met Trp Pro Arg Glu Ser Arg Ser Cys Arg Leu Thr Lys Val Lys He 
1 5 10. 15 
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Cys Gly Leu Ser Thr Lys Glu Ala Val Glu Thr Ala Val Ser Ala Gly 

20 25 30 

Ala Asp Tyr lie Gly Phe Val Phe Ala Pro Ser Lys Arg Gin Val Thr 

35 40 45 

Leu Glu Glu Ala Ala Glu Leu Ala Lys Leu He Pro Ala Asp Val Lys 

50 55 60 

Lys Val Gly Val Phe Val Ser Pro Ser Arg Val Glu Leu Leu Glu Ala 

65 70 7 5 80 

He Asp Lys Val Gly Leu Asp Leu Val Gin val His Gly Gin Val Ala 

85 90 95 

Asp Asp Leu Phe Glu Asn Leu Pro Cys Ala Ser He Gin Ala Val Gin 

100 105 110 

Val Asp Gly Asn Gly His Val Pro Asn Ser Gin Ala Asp Tyr Leu Leu 

115 120 125 

Phe Asp Ala Pro Val Ala Gly ser Gly Gin Ser Phe Asp Trp Gly Gin 

130 135 140 

Leu Asp Thr Thr Gly Leu Ala Gin Pro Phe Phe He Ala Gly Gly Leu 

155 160 
Asn Glu Asp Asn Val Val Lys Ala He Gin His Phe Thr Pro Tyr Ala 



165 



170 



175 



Val Asp Val Ser Ser Gly Val Glu Thr Asp Gly Gin Lys Asp His Glu 
180 

Lys He Arg Arg Phe He Glu Arg Val Lys His Gly He s.r Gly Thr 

205 



19 5 200 



Lys 



(2) INFORMATION FOR SEQ ID NO: 288 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 288: 

Met Asn Cys Glu Ala Val Ala Leu Gly Ser Phe Cys Glu Leu Lys Ser 

5 10 15 

Arg Arg Gly Lys Lys Gin Lys Gly Glu He Leu Met Ala Val He Ser 
20 2«5 
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Met Lys Gin Leu Leu Glu Ala Gly Val His Phe Gly His Gin Thr Arg 

35 40 45 

Arg Trp Asn Pro Lys Met Ala Lys Tyr He Phe Thr Glu Arg Asn Gly 

50 55 60 

He His Val He Asp Leu Gin Gin Thr Val Lys Tyr Ala Asp Gin Ala 
65 70 75 80 

Tyr Xaa Phe Met Arg Asp Ala Ala Ala Asn Asp Ala Val Val Leu Phe 

85 90 95 

Val Gly Thr Lys Lys Thr Ser Ser 
. 100 

(2) INFORMATION FOR SEQ ID NO:289: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 89: 

Met Val Asp Thr Asn Thr Asp Pro Asp Asp He Asp Val He He Pro 

1 " < m . 5 10 15 

Ala Asn Asp Asp Ala He Arg Ala Val Lys Leu He Thr Ala Lys Leu 

20 25 30 

Ala Asp Ala He He Glu Gly Arg Gin Gly Glu Asp Ala Val Ala Val 

35 40 45 



(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 



Met Glu Glu Leu Gly Gin Ser Tyr Gly Tyr Leu Leu Tyr Arg Thr Glu 
1 5 .10 15 
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Thr Asn. Trp Asp Ala Glu Glu Glu Arg Leu Arg lie He Asp Gly Arg 

20 25 3 0 

Asp Arg Ala Gin Leu Tyr Val Asp Gly Gin Trp Val Lys Thr Gin Tyr 

35 40 45 

Gin Thr Glu He Gly Glu Asp He Phe Tyr Gin Gly Lys Lys Lys Gly 

Leu Ser Arg Leu Asp He Leu He Glu Asn Met Gly Arg Val Asn Tyr 
" 70 75 80 

Gly His Lys Phe Leu Ala Asp Thr Gin Arg Lys Gly He Arg Thr Gly 

85 *0 95 

Val Cys Lys Asp Leu His Phe Leu Leu Asn Trp Lys His Tyr Pro Leu 

ioo 105 11Q 

Pro Leu Asp Asn Pro Glu Lys He Asp Phe Ser Lys Gly Trp Thr Gin 

115 12 ° 125 

Gly Gin Pro Ala Phe Tyr Ala Tyr Asp Phe Thr Val Glu 
130 "5 140 

(2) INFORMATION FOR SEQ ID NO: 291 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
Men Gly Lys Glu Lys Val Trp Leu Pro Arg Lys Gin Gly Leu Pro Gly 

5 io 15 

Leu Ala Tyr Tyr Val He Glu Val Ala His Lys Glu Glu Leu Leu Thr 

20 25 30 

He Ala Gin Arg Ala Gin Glu Val Asp Val Pro He Lys Trp Met Th^ 

35 40 45 

Ser ser Gin Leu Glu He Thr Asp Ser Asp Gly He Val Thr Cys He 



50 

Arg Leu Ala Arg 

.65 



(2) INFORMATION FOR SEQ ID NO:292 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 5 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:292: 

Met Asn Lys Lys Gin Trp Leu Gly Leu Gly Leu Val Ala Val Ala Ala 

15 10 15 

Val Gly Leu Ala Ala Cys Gly Asn Arg Ser Ser Arg Asn Ala Ala Ser 

20 25 30 

Ser Ser Asp Val Lys Thr Lys Ala Ala lie Val Thr Asp Thr Gly Gly 

35 40 45 

Val Asp Asp Lys Ser Phe Asn Gin Ser Ala Trp Glu Val Ala Gly Leu 
50 55 60 

Gly 
65 

(2) INFORMATION FOR SEQ ID NO: 2 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

Met Ser Gin Trp Asp Arg Lys Leu Asp Ala Arg Leu Ala Gin Ala Val 

15 10 15 

Val Ser lie Asn Ala Phe Lys Gly Val Glu Phe Gly Leu Gly Phe Glu 

20 25 30 

Ala Gly Tyr Arg Lys Gly Ser Gin Val Met Asp Glu lie Leu Trp Ser 

35 40 45 

Lys Glu Asp Gly Tyr Thr Arg Arg Thr Asn Asn Leu Gly Gly Phe Glu 

50 55 60 

Gly Gly Met Thr Asn Gly Gin Pro lie Val Val Arg Gly Val Met Lys 

65 70 75 80 

Pro lie Pro Thr Leu Tyr Lys Pro Leu Met Ser Val Asp lie Glu Thr 
85 90 95 
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His Glu Pro Tyr Lys Ala Thr Val Glu Arg Ser Asp Pro Thr Ala Leu 

100 105 no 

Pro Ala Ala Gly Met Val Met Glu Ala Val Val Ala Thr Val Leu Ala 

115 12 5 
Gin Glu He Leu Glu Lys Phe Ser Ser Asp Asn Leu Glu Glu Leu Lys 

130 135 140 

Glu Ala Val Ala Lys His Arg Asp Tyr Thr Lys Asn Tyr 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

Met Glu Asp Ser Asn Ser Phe Met Leu Ser His Gin Met Glu Thr He 

15 . 

Leu Lys Glu Ala Gly Phe Thr Lys Ala Val Ser Tyr Phe He Leu Glu 



20 25 



Leu Lys Asp Pro Ser Gin Thr Lys Val Val Thr Glu Glu Leu Gin Lys 

35 40 45 

Asn Lys Lys Tyr Thr Val Leu Ser 
50 55 



(2) INFORMATION FOR SEQ ID NO: 29 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single. 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
Met Gly Lys Lys Arg Trp Ala. Arg Asn Gly Phe Glu Ser Asn Asp Ala 



10 15 
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Ser Tyr Ala Gin 
20 

Ser Asn Asn Glu 
35 

Thr Asn Glu Gin 
50 

Glu 
65 



Val Val Ser Leu 

Thr Asp Lys Val 
40 

Gly Leu Thr lie 

55 



Tyr Asp Asp Thr 
25 

Leu Ala Gly Ser 

Pro Ser Ser Phe 
60 



PCT/US97/05306 

Ser He Ser Val 
30 

Leu Tyr Thr Asp 
45 

Thr Lys Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 296: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

Met Ala Met Glu Ser Gly Ala Thr Ala Val Val Ala Glu Arg Gly Gin 

1 5 .10 15 

Glu Arg lie Thr Lys Val Arg Glu He Leu Gly Gly Gly Ala Asp Ala 

20. 25 30 

Ala Leu Glu Cys Val Gly Thr Glu Ala Ala He Glu Gin Ala Leu Gly 

35 40 45 

Val Leu His Asn Gly Gly Arg Met Gly Phe Val Gly Val Pro His Tyr 

50 55 60 

Asn Asn Arg Ala Leu Gly Ser Thr Phe Met Gin Asn He Ser Val Ala 
65 70 75 80 

Gly Gly Ala Ala Ser Ala Thr Thr Tyr Asp Lys Gin Phe Leu Leu Lys 

85 90 95 

Ala Val Leu Asp Gly Asp He Asn Pro Gly Arg Val Phe Thr Ser Ser 

100 105 110 

Tyr Lys Leu Glu Asp He Asp Gin Ala Tyr Lys Asp Met Asp Glu Arg 

115 120 125 

Lys Thr He Lys Ser Met He Val He Glu 
130 135 



(2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 97: 

Met Lys Gin Val Val Asp Tyr Phe Leu Ser Gin Gly Met Asp Arg He 

1 ' 5 10 15 

Gly lie Leu Thr Gly Leu Glu Glu Thr Thr Asp Gin Glu Glu He H e 

20 25 30 

Gin Asp Lys Arg Leu Glu Asn Phe Lys Asn Tyr Ser Gin Ala Arg Gly 

35 4° 45 

He Tyr His Asp Glu Leu Val Phe Gin Gly Arg Phe Thr Ala Gin Ser 



50 " 60 



Gly Tyr Asp Leu Met Lys Glu Ala He oln S er Leu Gly Asp Gin Leu 

f 70 75 80 

Pro . Pro Ala Phe Phe Ala Ala Ser Asp Ser Leu Ala He Gly Ala Leu 



85 90 



Arg Ala Leu Gin Glu Ala Gly He Ser Leu Pro Asp Arg Val Ser Leu 

100 105 110 

He Phe Leu Leu Thr Thr Leu Ser Leu Thr Lys Gin Val Tyr Pro Pro 
115 120 125 

Leu Ser 
130 



(2) INFORMATION FOR SEQ ID NO: 29 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 
Met: Ala Lys Val Thr lie Met Leu Ala Cys Ala Ala Gly Met Ser Thr 
Ser Leu Leu Val Thr Lys Met Gin Lys aL Ala Glu Asp Lys Gly Leu 



20 



25 



30 
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Asp Ala Glu lie Phe Ala Val Pro Ala Pro Glu Ala Glu Glu lie Val 

35 40- 45 

Ala Thr Lys Glu Val Asn Val Leu Leu Leu Gly Pro Gin Val Arg Tyr 

50 55 60 

Leu Leu Gly Asp Phe Gin Glu Lys Leu Lys Asp Arg Gin lie Pro Val 
65 70 75 80 

Ala Val lie Pro Met Thr Asp Tyr Gly Met Met Asn Gly Ser Lys Val 

85 90 95 

Leu Asp Leu Ala Glu Ser Leu Leu Asp 
100 105 

(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 127 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

Met His Gly Ala Gly Val Asn Glu Pro Val Ala Glu Leu Ser Val Ala 

1 ' .• 5 10 15 

Glu Gin Leu Leu Glu Ala Gly Ala Asp Val lie Leu Val Pro Ala Val 

20 25 30 

Gly Thr Val Pro Ala Phe His Asp Gin Glu Leu Arg Glu Val Val Asp 

35 40 45 

Leu Val His Ser Lys Gly Gly Leu Val Leu Ser Ala lie Gly Thr Ser 

50 55 60 

Gin Glu Thr Ser Asp Thr Asp Thr lie Lys Glu lie Ala Leu Arg Asn 
65 70 75 80 

Lys lie Cys Gly Val Asp lie Gin His lie Gly Asp Ala Gly Tyr Gly 

85 90 95 

Gly Leu Ala Thr Val Asp Asn lie Tyr Ala Leu Ser Lys Ala He Arg 

100 105 110 

Gly Val Arg His Thr Val Ser Arg Leu Ala Arg Ser Val Asn Arg 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 3 00: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 23 6 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 00: 

Met Ser Ala lie Asn Leu Ala Ser Val Pro Lys Glu Ser Leu Thr Gin 

15 10 is 

Val Leu Pro Arg Asp Leu His Ala Glu Tyr Phe Ala Val Leu Ala Ser 

20 25 30 

He Ala Thr Ser He Glu Arg Met Ala Thr Glu He Arg Gly Leu Gin 

35 40 45 

Lys Ser Glu Gin Arg Glu Val Glu Glu Phe Phe Ala Lys Gly Gin Lys 

50 55 60 

Gly Ser Ser Ala Met Pro His Lys Arg Asn Pro He Gly Ser Glu Asn 
65 70 75 80 

Met Thr Gly Leu Ala Arg Val He Arg Gly His Met He Thr Ala Tyr 

85 90 95 

Glu Asn Val Ala Leu Trp His Glu Arg Asp He Ser His Ser Ser Ala 

100 105 no 

Glu Arg He He Thr Pro Asp Thr Thr He Leu He Asp Tyr Met Leu 

115 120 125 

Asn Arg Phe Gly Asn He Val Lys Asn Leu Thr Val Phe Pro. Glu Asn 

130 135 140 

Met He Arg Asn Met Asn Ser Thr Phe Gly Leu He Phe Ser Gin Arg 
145 150 155 160 

Ala Met Leu Thr Leu He Glu Lys Gly Met Thr Arg Glu Gin Ala Tyr 

165 170 175 

Asp Leu Val Gin Pro Lys Thr Ala Tyr Ser Trp Asp Asn Gin Val Asp 

180 185 190 

Phe Lys Pro Leu Leu Glu Ala Asp Ser Glu Val Thr Ser Arg Leu Thr 

195 200 205 

Gin Glu Glu He Asp Glu He Phe Asn Pro Val Tyr Tyr Thr Lys Arg 

210 215 220 

Val Asp Asp He Phe Glu Arg Leu Gly Leu Gly Asp 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 3 OL- 
ID SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



POT/IJS97/©S306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 01: 

Met Leu Asn Leu Thr His Val Thr Leu Lys Thr Arg Gin Val lie Leu 

1 5 10 is 

Gin Asp Ala Asp Phe Thr Phe Lys Lys Gly Arg lie Tyr Gly Leu Leu 

20 25 30 

Ala He Asn Gly Ser Gly Lys Thr Thr Leu Phe Arg Ala Met Ser Lys 

35 40 45 

Leu Leu Pro Leu Ser Ser Gly His He Ala Val Pro Pro Ser Leu Phe 

50 55 60 

Tyr Tyr Glu Ser Val Glu Trp Leu Asp Gly Asn Leu Ser Gly Met Asp 
65 70 75 80 

Tyr Leu Arg Leu He Lys Lys His Leu Glu Val Arg Pro Lys Leu Glu 
85 90 95 

Arg 



(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Tyr Thr Leu Gly Gin Pro Asn Glu Leu Ala Ala Glu Phe Leu Asn 

15 10 15 

Phe Val Leu Ser Asp Glu Thr Gin Glu Gly lie Val Lys Gly Leu Lys 

20 25 30 

Tyr He Pro He Lys Glu Met Lys Val Glu Lys Asp Ala Ala Gly Thr 

35 40 4 5 

Val Thr Val Leu Glu Gly Arg Gin 
50 55 



(2) INFORMATION FOR SEQ .ID NO: 3 03: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 : 

Met Asn Gin Glu Glu Leu Ala Lys Lys Met Leu Leu Pro Ser Lys Asn 

Ser Arg Leu Glu Lys Leu Gly Lys Gly Leu Thr Phe Ala Cys Leu Ser 

20 25 30 

Leu lie val He Leu Val Ala Met He Leu val Phe Val Ala Gin Lys 

35 40 45 

Gly Leu Ser Thr Phe Phe Val Asn Gly Val Asn lie Phe Asp Phe Leu 

50 55 60 

Leu Gly Gly Thr Trp Asn Pro Ser Ser Lys Glu Phe Gly Ala Leu Pro 
" 70 75 80 



Met He Leu Gly Ser Phe He Val Thr He Leu Ser Al 



85 90 



a Leu He Ala 



Thr Pro Phe Ala He Gly Ala Ala Val Phe Met Thr Glu Val Ser Pro 

100 105 110 

Lys Gly Ala Lys He Leu Gin Pro Ala He Glu Leu Leu Val Gly He 

115 ■ 120 125 

Pro Ser Val Val Tyr Gly Phe He Gly Leu Gin Val Val Val Pro Phe 

130 135 140 

Val Arg Ser Val Phe Gly Gly Thr Gly Phe Gly He Leu Ser Gly He 
145 150 155 160 



Ser Val Leu Phe Val Met He Leu Pro Thr Val Thr Phe Met Thr Thr 

165 ' 170 175 

Asp ser Leu Arg Ala Val Pro Xaa Leu Leu Ser 
180 185 



(2) INFORMATION FOR SEQ ID NO:304 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



1PCT/US97/053<B6 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

Met Thr Glu lie Arg Leu Glu His Val Ser Tyr Ala Tyr Gly Gin Glu 

1 5 10 15 

Arg lie Leu Glu Asp He Asn Leu Gin Val Thr Ser Gly Glu Val Val 

20 25 30 

Ser He Leu Gly Pro Ser Gly Val Gly Lys Thr Thr Leu Phe Asn Leu 

35 - 40 45 

He Ala Gly He Leu Glu Val Gin Ser Gly Arg He Val Leu Asp Gly 

50 55 60 

Glu Glu Asn Pro Lys Gly His Val Ser Tyr Met Leu Gin Lys Asp Leu 
65 70 75 80 

Leu Leu Glu His Lys Thr Val Leu Gly Asn He He Leu Pro Leu Leu 

85 90 95 

He Gin Lys Val Asp Lys Ala Glu Ala He Ser Arg Ala Asp Lys He 

100 105 110 

Leu Ala Thr Phe Gin Leu Thr Ala Val Arg Asp Lys Tyr Pro His Glu 

115 120 125 

Leu Ser Gly Gly Met Arg Gin Arg Val Ala Leu Leu Arg Thr Tyr Leu 

130 135 140 

Phe Gly His Lys Leu Phe Leu Leu . Asp Glu Ala Phe Ser Ala Leu Asp 
145 150 155 160 

Glu Met Thr Lys Met Glu Leu His Ala Trp Tyr Leu Glu He His Lys 

165 170 175 

Gin Leu Gin Leu Thr Thr Leu He Xaa Thr His Ser He Glu Glu Ala 

180 185 190 

Leu Xaa Leu Ser Asp Arg lie Tyr He Leu Xaa Asn Xaa Pro Gly Gin 

195 200 205 

He Val Ser Glu lie Lys Leu Asp Trp Ser Glu Asp Glu Asp Xaa Glu 

210 215 220 

Val Xaa Lys He Ala Xaa Lys Arg Gin He Leu Ala Glu Leu Gly Leu 
225 230 235 240 

Asp Lys 



(2) INFORMATION FOR SEQ ID NO: 305: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

Met Glu His Pro xaa Glu Ala Ala Asp He Leu He Lys Asn Ala Pro 

1.5 10 15 

Glu Leu Lys Glu Lys Arg Asp Phe Val He Glu Ser Gin Lys Tyr Leu 

20 25 30 

Ser Lys Glu Tyr Ala Ser Asp Lys Glu Lys Trp Gly Gin Phe Asp Ala 

35 40 45 

Ala Arg Trp Asn Ala Phe Tyr Lys Trp Asp Lys Glu Asn Gly He Leu 

50 55 60 

Lys Glu Asp Leu Thr Asp Lys Gly Phe Thr Asn Glu Phe Val Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 306 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 

15 io 15 

Leu Asn He Ala His Leu Tyr Gly Asn Leu Met Asn Thr Tyr Gly Asp 

20 25 30 

Asn Gly Asn He Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Thr 

35 40 45 

His val Thr Val Asp He Val Ser Leu His Asp Asp Phe Asp Glu Asn 

50 55 60 

His Tyr Asp He Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 70 75 80 

He He Ala Asp Asp Leu Pro Ala Lys Lys Glu Ser He Asp Asn Tyr 

85 go 95 

He Gin Asn Asp Gly val Val Leu Ala He Cys Gly Gly Phe Gin Leu 

100 105 no 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg He Glu Gly Leu 
115 120 12S 
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Gly Val Met Gly His 
130 

Gly Asp lie Lys lie 
145 



Tyr Thr Leu Asn Gin Thr 
135 

His Asn Glu Arg Phe Arg 
150 155 



PCT/US97/053O6 

Asn Asn Arg Phe lie 
140 



(2) INFORMATION FOR SEQ ID NO: 3 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 

Met His Gin His Ser Pro His Thr Ala Ser Tyr Thr Ser Ser Arg Asp 

1 5 10 15 

Ala Met Ser Lys Tyr Pro Glu Arg Cys Thr Thr Val Gly Leu Arg Leu 

20 25 30 

Asn Glu Glu Ser Asp Phe Glu Leu Tyr Ala Pro Tyr Gly Leu Glu Asp 

35 40 45 

lie Leu Asn Phe Lys Phe Val Gin Leu Leu lie Ser 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 3 08: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

Met Phe Glu lie Phe Asn Met Gly Val Gly Leu Met Leu Ala Val Ser 

15 10 15 

Pro Glu Asn Val Glu Arg Val Lys Glu Leu Leu Asp Glu Ala Val Tyr 

20 25 30 

Glu lie Gly Arg lie Val Lys Lys Glu Asn Glu Ser Val He He Lys 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 3 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 



Met Met Val Lys Lys His Tyr Phe Lys Asp Asn Glu Glu Asp Ser Lys 

15 10 15 

Met Arg Arg lie Leu Leu Leu Val Ala Val Leu Leu Met He Pro Ser 

20 25 30 

Phe lie Ser Ala Thr Thr Leu Val Arg Glu Thr Leu Lys Lys Glu Ser 

35 40 45 

Leu Lys Lys Phe lie Ser Glu Gin Phe Gin Gly His Asn He Leu Lys 

50 55 60 

Lys Thr Tyr Ser Lys Lys Thr His Thr Leu Lys Leu Thr He Ser Gly 
65 70 75 so 

Asn Tyr Leu Thr Glu Glu Glu Leu Asp Met He Ser Ser Lys Arg Gly 

85 90 95 

Asp Tyr Gly Leu Ser Asp Val Ser Val Glh Val Ser Gin Leu Ser Asp 

100 105 110 

Ser Glu Gin Leu Ser Lys Glu Glu Leu Val Glu Tyr Phe Phe Gin Tyr 

115 120 12 s 

He Lys Asp Lys Glu Ala Lys Glu Lys Glu Lys Ala Asn Lys Phe Tyr 

130 135 140 

Thr Glu Ser Glu Glu Gin 
145 iso 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 



PCT/US97/0S306 



Met Ser lie Glu Pro Arg Arg Arg Trp Arg Asn Ser Thr Val Cys Leu 

15 10 15 

Glu Ala Glu Leu Tyr Gin Glu Ser Leu Val Leu lie Gly Gly Asp Pro 

20 25 30 

Gly lie Gly Lys Ser Thr Leu Leu Leu Gin Val Ser Thr Gin Leu Ser 

35 40 45 

Gin Val Gly Thr Val Leu Tyr Val Ser Gly Glu Glu Ser Ala Gin Gin 

50 55 60 

lie Lys Leu Arg Ala Glu Arg Leu Gly Asp Phe Asp Ser Glu Phe Tyr 
65 70 75 * 80 

Leu Tyr Ala Glu Thr Asn Met Gin Ser Val Arg Ala Glu Val Val Ala 

85 90 95 

Tyr Pro Ala Arg Leu Ser His Tyr 
100 

(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

Met Ala Thr Lys Gin Lys Glu Val Thr Thr Phe Asp Val Gin Val Ala 

15 10 15 

Glu Phe lie Arg Asn His Lys Gin Lys Gly Thr Ala Thr Asp Asp Glu 

20 25 30 

lie Asn Ala Ser Leu Val lie Pro Phe Thr Leu Asp Ala Asp Gly lie 

35 40 45 

Glu Asp Leu Leu Gin Arg lie Gin Asp Ala Gly lie Ser lie Thr Asp 

50 55 60 

Asn Glu Gly Asn Pro Ser Ala Arg Val Leu Ser Asn Glu Glu Glu Pro 
65 70 75 80 

Glu Leu Ser Asp Glu Asp Leu lie Gly Ser Thr Ser Ala Lys Val Asn 

85 90 . 95 

Asp Pro Val Arg Met Tyr Leu Lys Glu lie Gly Val Val Pro Leu Leu 
100 105 110 
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Thr Asn Glu Glu Glu Lys Glu Leu Ala Leu Ala Val Glu Ala Gly Asp 

115 120 125 

lie Glu Ala Lys Gin Arg Leu Ala Glu Ala Asn Leu Arg Leu Val Val 

130 135 140 

Ser He Ala Lys Arg Tyr Val Gly Arg Gly Xaa Gin Ser Leu Thr 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 312: 
Met Met Leu Lys Pro Ser lie Asp Thr Leu Leu Asp Lys Val Pro Ser 



1 5 io 



15 



Lys Tyr Ser Leu Val He Leu Glu Ala Lys Arg Ala His Glu Leu Glu 



20 25 



30 



Ala Gly Ala Pro Ala Thr Gin Gly Phe Lys Ser Glu Lys Ser Thr Leu 
35 4 0 ._ 



45 



Arg Ala Leu Glu Glu He Glu Ser Gly Asn Val Thr He His Pro Asp 

50 55 60 

Pro Glu Gly Lys Arg Glu Ala Val Arg Arg Arg He Glu Glu Glu Lys 
f 70 75 80 

Arg Arg Lys Glu Glu Glu Glu Lys Lys He Lys Glu Gin He Ala Lys 

" 90 95 

Glu Lys Glu Asp Gly Glu Lys He 

100 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
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Met He Arg Gin Gly Gin Met. Leu Glu Tyr Ala Glu Tyr Val Gly Asn 

15 10 15 

Tyr Tyr Gly Thr Pro Leu Thr Tyr Val Asn Glu Thr Leu Asp Lys Gly 

20 25 30 

He Asp Val Phe Leu Glu lie Glu Val Gin Gly Ala Leu Gin Val Lys 

35 40 45 

Lys Lys Val Pro Asp Ala Val Phe He Phe Leu Thr Pro Pro Asp Leu 

50 55 60 

Asp Glu Leu Gin Glu Arg Leu Val Gly Arg Gly Thr Asp Ser Ala Glu 
65 70 75 80 

Val lie Ala Gin Arg He Glu Lys Ala Lys Glu Glu He Ala Leu Met 

85 90 95 

Arg Glu Tyr Asp Tyr Ala He Val Asn Asp Gin Val Pro Leu Ala Ala 

100 105 110 

Glu Arg Val Lys Cys Val He Glu Ala Glu His Phe Cys Vai Asp Arg 

115 120 125 

Val lie Gly His Tyr Gin Glu Met Leu Pro Lys Ser Pro Thr Thr Arg 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 14: 

Met Ser Arg Leu Leu Val He Gly Cys Gly Gly Val Ala Gin Val Ala 

15 10 15 

He Ser Lys He Cys Gin Asp Ser Glu Thr Phe Thr Glu He Met He 

20 . 25 30 

Ala Ser Arg Thr Lys Ser Lys Cys Asp Asp Leu Lys Ala Lys Leu Glu 

35 40 45 

Gly Lys Thr Ser Thr Lys He Glu Thr Ala Ala Leu Asp Ala Asp Lys 

50 55 60 

Val Glu Glu Val He Ala Leu He Glu Ser Leu Gin Thr Ser Lys Leu 
65 70 75 80 

Phe 
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(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 15: 

Met His Glu Ser Phe His Val Asp Asp Pro Thr Leu Tyr Ser Arg Glu 

1 5 10 15 

Trp Phe Ser Trp Ala Asn Met Met Phe Cys Glu Leu Val Leu Asp Tyr 

20 25 30 

Leu Asp He Arg 

35 

(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 16: 

Met Arg Phe Tyr Phe Met Glu Asn Val Val Val His He He Se^ His 

15 10 15 

Ser His Trp Asp Arg Glu Trp Tyr Leu Pro Phe Glu Ser His Arg Met 



20 



25 



30 



Gin Leu Val Glu Leu Phe Asp Asn Leu Phe Asp Leu Phe Glu Asn Asp 
35 4 0 



45 



Pro Glu Phe Lys Ser Phe His Leu Asp Gly Gin Thr He Val Leu Asp 



Asp 
65 



50 55 60 
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(2) INFORMATION FOR SEQ ID NO: 317: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

Met Leu Leu Tyr lie Leu Ser His lie Ala Thr Gly lie Val Ser Gly 

1 5 10 15 

Thr Cys Leu Leu Lys Ala lie Val Cys Asn Trp Trp Asn Cys Leu Thr 

20 25 30 

lie Ser Leu lie Ser Leu Lys Met Thr Leu Ser Ser Arg Val Ser Thr 

35 40 45 

Trp Met Asp Lys Leu Leu Ser Leu Thr Thr Asn Leu Gin lie Arg Pro 

50 55 60 

Glu Asn Arg Asp Lys Val Gin Arg Tyr lie Asp Glu Gly Lys Leu Lys 
65 70 75 80 

lie Gly Pro Phe Tyr lie Leu Gin Asp Asp Tyr Leu lie Ser Ser Glu 

85 90 95 

Ala Asn Val Arg Asn Thr Leu lie Gly Gin Gin Glu Ala Ala Lys Trp 

100 105 110 

Gly Lys Ser Thr Gin lie Gly Tyr Phe Pro Asp Thr Phe Gly Asn Met 

115 120 125 

Gly Gin Ala Pro Gin He Leu Gin Lys Ser Gly He His Val Ala Ala 

130 135 140 

Phe Gly Arg Gly Val Lys Pro He Gly Phe Asp Asn Gin Val Leu Glu 
145 150 155 . 160 

Asp Glu Gin Phe Thr Ser Gin Phe Ser Glu Met Tyr Trp Gin Gly Val 

165 170 175 

Asp Gly Ser Arg Val Leu Gly He Leu Phe Ala Asn Trp Tyr Ser Asn 

180 185 190 

Gly Asn Glu He Pro Val Asp Lys Asp Glu Ala Leu Thr Phe Trp Lys 

195 200 205 

Gin Lys Leu Ser Asp Val Arg Cys Leu Arg Phe Asp Gin Pro Met Val 

210 215 220 

Asp Asp Glu Thr Ala Val Asn Thr Ser Leu Ser Gin Lys Lys Ser Glu 
225 230 235 240 

Arg Ser His Ser Val Trp Gin Met Asn Ser Ser Arg Met 
245 250 
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(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

Met Thr He Val Gly Cys Arg He Asp Gly Arg Leu lie His Gly Gin 

15 10 15 

Val Ala Asn Leu Trp Ala Gly Lys Leu Asn Val Ser Arg He Met Val 

20 25 30 

Val Asp Asp Glu Val Val Asn Asn Asp He Glu Lys Ser Gly Leu Lys 



35 *0 45 



Leu Ala Thr Pro Pro Gly Val Lys Leu Ser He Leu Pro Val Glu Lys 

50 55 60 

Ala Ala Ala Asn He Leu Ala Gly Lys Tyr Asp Ser Gin Arg Leu Phe 
65 70 75 8q 

He Val Ala Arg Lys Pro Asp Arg Phe Pro Trp Phe Gly Arg Ser Arg 



85 



Cys Thr Thr 



90 



.95 



(2) INFORMATION FOR SEQ ID NO: 3 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
S (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

Met lie Gin His Pro Arg He Gly He Arg Pro Thr lie Asp Gly Arg 

5 10 15 

Arg Gin Gly Val Arg Glu Ser Leu Glu Val Gin Thr Met Asn Met Ala 

20 25 30 
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Lys Ser Val Ala Asp Leu lie Ser Ser Thr Leu Lys Tyr Pro Asp Gly 

35 40 45 

Glu Pro Val Glu Cys Val lie Ser Pro Ser Thr lie Gly Arg Val Pro 

50 55 60 

Glu Ala Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val Cys Ala 
65 70 75 80 

Thr lie Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr Met Asp 

85 90 95 

Met Ser Pro Asp lie Pro His Ala lie Trp Gly Phe Asn Gly Thr Glu 

. i00 105 110 

Arg Pro Gly Ala Val Tyr Leu Ala Ala Val Leu Ala Ser His Ala Gin 

115 120 125 

Lys Gly lie Pro Ala Phe Gly lie Tyr Gly Arg Asp Val Gin Glu Ala 

130 135 140 

Asn Asp Thr Asp lie Pro Glu Asp Val Lys Glu Asn Phe 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 



Met Leu Leu Ser Thr Lys Gly lie Glu Glu Gin Glu lie Ala Arg Pro 

1 5 10 15 

Thr Leu Glu Arg Leu Phe Ser Met Arg Glu Asn Tyr Lys Val Thr Gly 

20 25 30 

Arg His Pro Gly Tyr Arg Lys Tyr Asn Gly Asp Gly Ser Met Lys Glu 

35 40 45 

Thr Glu Lys 
50 

(2) INFORMATION -FOR SEQ ID NO: 321: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

Met Leu Thr Leu Thr Pro Asp Val lie Tyr Gin Glu Pro Asp He Leu 

Tyr Gin His Glu Asp Phe Ser Leu Val Lys He Arg Gin He Arg Phe 
■ 20 25 30 

Cys 

(2) INFORMATION FOR SEQ ID NO: 3 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

Met Ser Ser Val Val Gly Leu Met Gly Asn He Gly Gin Ala Asn Tyr 

Ala Ala Ser Lys Ala Gly Leu He Gly Phe Thr Lys Ser Val Ala Arc 

20 25 30 

Glu val Ala Ser Arg Asn He Arg Val Asn Val He Ala Pro Gly Asn 

Asp 



35 40 



(2) INFORMATION FOR SEQ ID NO: 3 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:323: 
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Met Lys Leu Glu His Lys Asn He Phe He Thr Gly Ser Ser Arg Gly 

15 10 15 

He Gly Leu Ala He Ala His Lys Phe Ala Gin Ala Gly Ala Asn He 

20 25 30 

Val Leu Asn Ser Arg Gly Ala He Ser Glu Glu Leu Leu Ala Glu Phe 

35 40 45 

Ser Asn Tyr Gly He Lys Val Val Pro He Ser Gly Asp Val Ser Asp 

50 -55 60 

Phe Ala Asp Ala Lys Arg Met He Asp Gin Ala He Ala Glu Leu Gly 
65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly He Thr Gin Asp Thr Leu 

85 90 95 

Met Leu Lys Met Thr Glu Ala Asp Phe Glu Lys Val Leu Lys Val Asn 

100 105 110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Phe Gly Asn Arg 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 32 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

Met Glu Ser Met Pro Arg lie Gly Leu Leu Val Thr Val Glu Asn Arg 

15 10 15 

Asp Thr Glu Thr lie Phe Asn Ala Ala Gly Leu Asp Phe Asp Val Leu 

20 25 30 

Lys* Ala Ser Ala lie Ala Tyr lie Asn Ala Asn Thr Phe Val Gin Lys 

35 40 45 

Glu Asn Ala Gly Glu Met Gly Arg Ser Val Ser Tyr His Asp Met Arg 

50 55 60 

Ser Val 
65 

(2) INFORMATION FOR SEQ ID NO: 325: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

Met Arg Gin Gly Gin Gly Ser Val Glu Ala He Phe Asn Ala lie Asp 
1 

Lys Phe Phe Asn Gin Ser Val Arg Leu Val Ser Tyr Thr He Asn Ala 



20 25 



30 



Val Thr Asp Gly He Asp Ala Gin Asp Arg Val Val Gly His Cys 

45 



35 40 



(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:326: 
Met Val Glu Asn Pro Glu Gly Phe His Phe Asp Asp Leu Gin Leu Gin 

1 5 io 15 

Thr His Ala Asp Asn Asp He Glu Ala Leu Val Ser Leu Ala Asn Met 



20 25 



30 



Asp Gly Glu Lys Val Glu Phe Asn Ala Thr Gly Thr Gly Phe Arg 
35 « 45 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:327: 

Met Asn Phe Gin Leu Ala Lys Tyr Ser Leu Leu Lys Lys Phe Ser Glu 

15 10 15 

Asn lie Gly Phe Thr Thr Pro Glu Glu Cys Gly Ala lie Phe Gin Tyr 

20 25 30 

Leu He Glu Asn Val Gin Thr Asp Arg Gin He He Tyr Ser Pro Pro 

35 40 45 

Cys His Asp Glu Leu Arg Met Ala Val Ala Asn Ser Leu Ala Ala Val 

50 55 60 

Lys Asn Gly Ala Gly Leu Phe Glu Glu Thr lie His Gly He Arg Glu 
65 70 75 80 

Arg Ala Glu Asn Ala Ala Leu Glu Glu He Ala Val Ala Leu Asn He 

85 90 . 95 

Arg Gin Asp Tyr Tyr Gin Val Glu Thr Ser He Val Leu Asn Glu Thr 

100 105 , HO 

lie Asn Thr Ser Glu Met Val Ser Arg Phe Ser Gly He Pro Val Pro 

115 120 125 

Lys Asn Lys Ala Val Val Gly Gly Asn Thr Phe Ser His Glu Ser Gly 

130 135 140 

He His Gin Asp Gly Val Leu Lys Asn Pro Leu Thr Tyr Glu He He 
145 150 155 160 

Thr Pro Glu Leu. Val Gly Val Lys He Pro Leu Gly Lys Leu Ser Gly 

165 170 175 

Arg His Ala Phe Val Glu Lys Leu Arg Glu Leu Ala Leu Asp Phe Thr 

180 185 190 

Glu Glu Asp He Lys Pro Phe Phe Ala Lys Phe Lys Ala Leu Ala Asp 
195 200 205 . 

Lys Lys 
210 

(2) INFORMATION FOR SEQ ID NO: 32 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 328: 

256 



W ° 97/37026 PCT/US97/05306 

Met ser Gin Gin Val Lys Asn Ala His Asn Leu Tyr He His Ala He 

1 5 10 15 

Gin Asp Gly Arg Val Ala Glu Ala Gin Ala Gin Ser Val Gly Asp Thr 



20 



25 



30 



Tyr lie Gin His Ser Thr Gly Val Pro Asp Gly Lys Glu Gly Phe Ala 
„ ^ 40 45 

Ala Phe Phe Ala Asp Phe Phe Glu Arg His Pro Glu Arg Gin He Lys 
50 55 60 



He val Arg Thr He Glu Asp Gly Asn Leu Val Phe Val His Val Pro 

70 75 80 

Ser lie Ser Glu Trp Trp Arg Ser Ser Met Gly Asp Asp Gly Tyr Phe 



Pro Cys Gly 



85 9Q 



95 



(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 122 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

Met Glu Met Gin Thr Ser Ala Gin Leu Leu Thr Asn L y S He Phe Leu 

L ys Asn Pro Leu Lys Ala Leu Val Glu Glu Lys Tyr Gly He Glu Tyr 

20 25 
Glu Glu Phe Thr Asn Pro Trp His Ala Ala He Ser Ser Phe Val Ala 

Phe Phe Leu Arg Ser Leu Pro Pro Met Leu Ser Val 11 He Phe Pro 
o 50 55 6Q 

Ser Glu Tyr Arg He Pro Ala Thr Val Leu ^ ^ ^ ^ ^ 

Leu Leu Th r Gly Ty r Thr Ser Ala Arg Leu Gl'y Lys Asp Pro Thr Irg 

Thr Ala Met He Arg Asn Leu Ala He T ly Leu Leu Thr Met Gly Val 

100 105 110 

Thr Phe Leu Leu Glu Gin Leu Phe Ser He 

115 120 
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(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met Ala Phe Phe Leu Cys Pro His Phe Arg Ser Asn His Trp Lys lie 

15 10 15 

Leu Thr Val Ser Asp Thr Met Glu Glu Lys . Arg Leu Glu Tyr Pro Met 

20 25 30 

Val Thr Phe Leu Gly Asn Pro Val Ser Phe Thr Gly Lys Gin Leu Gin 

35 40 45 

Val Gly Asp Lys Ala Leu Asp Phe Ser Leu Thr Thr Thr Asp Leu Ser 

50 55 60 

Lys Lys Ser Leu Ala Asp Phe Asp Gly Lys Lys Lys Val Leu Ser Val 
65 70 75 80 

Val Pro Ser lie Asp Thr Gly He Cys Ser Thr Gin Thr Arg Arg Phe 

85 90 95 

Asn Glu Glu Leu Ala Gly Leu Asp Asn Thr Val Val Leu Thr Val Ser 

100 105 HO 

Met Asp Leu Pro Phe Ala Gin Lys Arg Trp Cys Gly Ala Glu Gly Leu 

115 120 125 

Asp Asn Ala He Met Leu Ser Asp Tyr Phe Asp His Ser Phe Gly Arg 

130 135 140 

Asp Tyr Ala Leu Leu He Asn Glu Trp His Leu Leu Ala Arg Ala Val 
145 150 155 160 

Phe Val Leu Asp Thr Asp Asn Thr He Arg Tyr Val Glu Tyr Val Asp 

165 170 175 

Asn He Asn Ser Glu Pro Asn Phe Glu Ala Ala He Ala Ala Ala Lys 
180 185 190 

Ala Leu 



(2) INFORMATION FOR SEQ ID NO: 3 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 amino acids 

258 



WO 97/37026 PCT/US97/05306 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 
Met Gly Tyr Arg Pro Ser Thr Ala Asn Ala lie He His Gin Val Arg 
Glu Leu Leu Val Ser Arg Gly Tyr Thr Phe Tyr Asn Arg Lys Arg Lev 



20 25 



2U 

Met Val Val Pro Lys Ser Val Val Lys Glu Leu Leu Gly Met Glu Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 
Met Gin Ala Val Glu His Phe He Lys Gin Phe Val Pro Glu His Tyr 



10 



15 



Asp Leu Phe Leu Asp Leu Ser Arg Glu Thr Lys Thr Phe Ser Gly Lys 

20 25 
Val Thr lie Thr Gly Gin Ala Gln Ser Asp Arg ^ ^ ^ ^ ^ 

35 40 45 

Lys Asp Leu Glu lie Thr Ser Val Glu Val Ala Gly Gin Ala Arg Pro 
50 55 ^ 

Phe Thr Val Asp His Asp Asn Glu Ala Leu His He Glu Leu Ala Glu 

70 75 
Ala Gly Gin Val Glu Leu Val Leu Ala Phe Ser Gly Lys Ile Thr " p 

Asn Met Thr Gly He Tyr Pro Ser Tyr Tyr Thr Val Asp Gly Val Lys 



110 



Lys Glu Val Leu Ser Thr Gin Phe Glu Ser His Phe Ala Arg Glu Ala 
115 120 125 
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Phe Pro Cys Val Asp Glu Pro Glu Ala Lys Ala Thr Phe Asp Leu Ser 

130 135 140 

Leu Arg Phe Asp Gin Ala Glu Gly Glu Leu Ala Leu Ser Asn Met Pro 
145 150 155 160 

Glu lie Asp Val Glu Asn Arg Lys Glu Thr Gly lie Trp Lys Phe Glu 

165 170 175 

Thr Thr Pro Arg Met Ser Ser Tyr Leu Leu Ala Phe Val Ala Gly Asp 

180 185 190 

Leu Gin Gly Val Thr Ala Lys Thr Lys Asn Gly Thr Leu Val Gly Cys 

195 200 205 

Leu Leu Asn Gin Ser Thr Ser Thr Phe Lys Ser 
210 215 



(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TO POLOG Y : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 3: 

Met Ser Gin Glu Phe Leu Ala Arg lie Leu Glu Gin Lys Ala Arg Glu 

1 5 10 15 

Val Glu Gin Met Lys Leu Glu Gin lie Gin Pro Leu Arg Gin Thr Tyr 

20 25 30 

Arg Leu Ala Glu Phe Leu Lys Asn His Gin Asp Arg Leu Gin Val lie 

35 40 45 

Ala Glu Val Lys Lys Ala Ser Pro Ser Phe Gly Glu lie Ser lie Ser 

50 55 60 

Met Trp He Leu Cys Asn Arg Pro Arg Leu Met Lys Lys Thr Glu Gin 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 33 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Met Ser Ser Phe Thr Pro Glu Asp Leu Gly Met Glu Gly Tyr Ala Met 

] 5 10 15 

Glu Asp He Arg Gly Gly Asn Ala Gin Glu Asn Ala Glu He Leu Leu 



20 25 30 



Ser Val Leu Lys Asn Glu Ala Ser Pro Phe Leu Glu Thr Thr Val Leu 

35 40 45 

Asn Ala Gly Leu Gly Phe Tyr Ala Asn Gly Lys He Asp Ser He Lys 

50 55 60 

Glu Gly Val Ala Leu Ala Arg Gin Val He Ala Arg Gly Lys Ala Leu 
65 70 75 80 

Glu Lys Leu Arg Leu Leu Gin Glu Tyr Gin Lys 
85 9 0 

(2) INFORMATION FOR SEQ ID NO: 33 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear . 

(ii) ' MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Met lie Tyr Thr Val Thr Leu Asn Pro Ser He Asp Tyr He Val Arg 

15 10 15 

Leu Asp Gin Val Lys Val Gly Ser Val Asn Arg Met Asp Ser Asp Asp 

20 25 30 

Lys Phe Ala Gly Gly Lys Gly He Asn Val Ser Arg Val Leu Lys Arg 

35 40 45 

Leu Asn He Ser Asn Thr Ala Thr Gly Phe He Gly Gly Phe Thr Gly 



33 60 

Ly S Phe Xaa Xaa Asp Thr Leu Ala Glu Glu Glu He Glu Xaa Arg Phe 

70 7 5 80 

Val Gin val Ala Glu Asp Thr Arg He Asn Val Lys He Xaa Ala Asp 



85 90 95 



Gin Glu Thr Glu He Asn Gly Thr Gly Pro Thr Val Glu Pro Val Lys 
100 105 110 
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Leu Glu Glu Leu Lys Ala lie Leu Ser Ser Leu Thr Ala Glu Asp Thr 

. 115 . 120 125 

Val Val Phe Ala Gly Ser Ser Ala Lys Asn Leu Gly Asn Val lie Tyr 

130 135 140 

Lys Gly Phe Asn Leu Leu Asp Ala Pro Asp Trp Cys Ala Ser Gly Leu 

145 150 155 160 

(2) INFORMATION FOR SEQ ID NO: 3 36: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 133 amino acids 
(B> TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 36: 

Met Met Gin Val Tyr Phe Asp Gin Gly lie Tyr Asn Lys Lys Ala Val 

15 10 15 

Phe Glu Val Tyr Phe Arg Gin Gin Pro Phe Lys Asn Gly Tyr Ala Val 

20 25 30 

Phe Ala Gly Leu Glu Arg lie Val Asn Tyr Leu Glu Asp Leu Arg Phe 

35 40 45 

Ser Asp Ser Asp lie Ala Tyr Leu Glu Ser Leu Gly Tyr His Gly Ala 

50 . 55 60 

Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg Ser 
65 70 75 80 

Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro lie Val Gin Val 

85 90 95 

Glu Gly Pro Leu Ala Gin Cys Gin Leu Val Glu Thr Ala Leu Leu Asn 

100 105 110 

lie Val Asn Tyr Gin Thr Leu Val Ala Thr Lys Ala Ala Pro Tyr Pro 

115 120 125 

Phe Gly Tyr Arg Lys 
130 

(2) INFORMATION FOR SEQ ID NO: 33 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 37: 

Met Lys lie He He Gin Arg Val Lys Lys Ala Gin Val Ser He Glu 

5 10 15 

Gly Gin He Gin Gly Lys He Asn Gin nix, t^, t 

x yi > lie Asn Gin Gly Leu Leu Leu Leu Val Gly 

20 25 
Val Gly Pro Glu Asp Gin Glu Glu Asp Leu Asp Tyr Ala Val Arg Lys 

35 40 45 

Leu val Asn Met Arg He Phe Ser Asp Val Glu Gly Lys Met Asn Leu 

50 55 60 

Ser Val Lys Asp He Glu Gly Glu lie Leu Ser He Leu Ser Leu Pro 

70 75 80 

Leu Cys Gly Tyr 

(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid . 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

Met lie Lys Gly Thr He L ys Thr Val Ser He Met Ala Ala Thr Ser 

5 10 15 

Gly Phe Leu Leu Tyr Asn Glu Val Phe Phe Leu Thr Asn Gly Ala Ala 

2 0 25 

Gly Thr Lys Ser lie Ser Phe Val He Arg Glu Leu Ala Val Ala Ser 

Ser Arg Thr Gin Tyr Ala Arg Ala Asn Thr He Gly Val He Gin He 

55 

^ MSt LSU " e Ile V*l Cys He Asn He Leu Phe Arg Glu 

70 ?5 

Arg Lys Arg Leu Lys Gly Gly Lys 

85 



80 
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(2) INFORMATION FOR SEQ ID NO: 33 9: 
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(i) .SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
.(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 



Met Gly Phe Gly lie Pro Ala Ala lie Gly Ala Lys lie Ala Asn Pro 

15 10 15 

Asp Lys Glu Val Val Leu Phe Val Gly Asp Gly Gly Phe Gin Met Thr 

20 25 30 

Asn Gin Glu Leu Ala lie Leu Asn lie Tyr Lys Val Pro lie Lys Val 

35 40 45 

Val Met Leu Asn Asn His Ser Leu Gly Met Val Arg Gin Trp Gin Glu 

50 55 60 

Ser Phe Tyr Glu Gly Arg Thr Ser Glu Ser Val Phe Asp Thr Leu Pro 
65 70 75 80 

Asp Phe Gin Leu Met Ala Gin Ala Tyr Gly lie Lys Asn Tyr Lys Phe 

85 90 95 

Asp Asn Pro Glu Thr Leu Ala Gin Asp Leu Glu Ala Thr Thr Glu Asp 

100 105 110 

Val Pro Met Leu lie Glu Val Asp lie Ser Arg Lys Glu Gin Val Leu 

115 120 125 

Pro Met Val Pro Ala Gly Lys Ser Asn His Glu Met Leu Gly Val Lys 

130 135 140 

Phe His Ala 
145 



(2) INFORMATION FOR SEQ ID NO: 3 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
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Met Trp Leu Pro Leu Asp Arg Asn Asn Lys Gly Gin Asn He Ala Gly 

5 10 15 

Ala Arg Gin Ala Ala Glu Gly H e Phe Gly Val A Sp Ala Ser Gin Leu 



25 



30 



Thr val Pro Gin Ala Ala Phe Leu Ala Gly Leu Pro Gin Ser Pro He 

35 40 45 

Thr Tyr Ser Pro Tyr Glu Asn Thr Gly Glu Leu Lys Asn 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 341: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

Met Tyr Leu Gly Asp Leu Met Glu Lys Ala Glu Cys Gly Gln Phe Ser 

lie Leu ser Phe Leu Leu Gin Glu Ser Gin Thr Thr Val Lys aL Val 

-20. 25 
Met Glu Glu Thr Gly Phe Ser Lys Ala Thr Leu Thr Lys ^ val Thr 

r t 40 45 

^ ASP Ala LeU Asp Ser Gly Leu Glu Leu Thr He His 

5 5 

Ser Glu Asp Glu Asn Leu Arg Leu Ser He Gly Ala Ala Thr Lys Gly 
Arg Asp He Pro Glu 11 Val Phe Trp lie Val Leu Leu Asn Thr T rg 



85 90 



Phe Trp Phe He Phe Ser Thr Thr Asn Ser Phe 



95 



100 105 



(2) INFORMATION FOR SEQ ID NO: 342 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

Met Lys Glu lie lie Glu Lys Leu Ala Lys Phe Glu Asn Leu Ser Gly 

15 10 15 

Val Glu Met Thr Asp Val lie Glu Arg lie Val Thr Gly Arg Val Thr 

20 25 30 

Glu Ala Gin lie Ala Ser Leu Leu Leu Ala Leu Lys Met Lys Gly Glu 

35 40 45 

Thr Pro Glu Glu Arg Thr Ala lie Ala Gin Val Met Arg Gly His Ala 

50 55 60 

Gin His lie Pro Thr Glu lie His Asp Ala Met Asp Asn Cys Gly Thr 
65 70 75 80 

Gly Gly Asp Lys Ser Phe Ser Phe Asn lie Ser Thr Thr Ala Ala Phe 

85 90 95 

Val Leu Ala Gly Gly Gly lie His Met Ala Lys His Gly Asn Arg Ser 

100 105 110 

lie Ser Ser Lys Ser Gly Ser Ala Asp Ser Xaa Asn Leu Gly Asn Gin 
115 120 125 

Ser 



(2). INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 9 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 



Met Asn Val Gin Met Ser Ser Lys 

1 5 
Glu Met Gin Asn lie Gin Arg Arg 
20 

Gin Arg Tyr Arg Ser Gin Asp Leu 

35 40 
Asp Asn Leu Glu Arg Ala Leu Ala 
50 55 



Thr Asn lie Leu Arg Ala His Ala 

10 15 
Ala Asn Glu Glu Arg Gin Asn Leu 
25 30 
Ala Lys Ala lie Leu Pro Ser Leu 
45 

Val Glu Gly Leu Thr Asp Asp Val 
60 
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Lys Lys .Gly Leu Gly Met Val Gin Glu Ser Leu He His Ala Leu Lys 



65 70 75 



80 



Glu Glu Gly He Glu Glu He Ala Ala Asp Gly Glu Phe Asp His Asn 

85 90 „ 

Tyr His Met Ala He Gin Thr Leu Pro Ala Asp Asp Asp His Pro Val 

100 105 11Q 

Asp Thr He Ala Gin Val Phe Gin Lys Gly Tyr Lys Leu His Asp Arg 

115 120 125 

He Leu Arg Pro Ala Met Val Val Val Tyr Asn 
130 135 

(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Met Ser Asn Lys Ser Tyr Ser Lys Phe Gin Glu Glu Val Ser Leu Lys 

1 5 10 15 

Tyr Gly Phe He Gly Leu Lys Leu Asp Lys Leu Ser Leu Thr Ala Glu 

20 25 30 

Val Ser Glu Glu Phe His Ser Glu He Leu Ser Gly Asn Phe Thr Leu 



35 40 



Tyr Asp He Ser Val 
50 



(2) INFORMATION FOR SEQ ID NO: 3 45: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
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Met His Phe Asp Lys Ser Lys Phe 

1 5 
Leu Tyr Glu Val Glu Val lie Asn 
20 

Gin Tyr Glu Val lie Gin Ser Arg 

35 40 
lie Glu Met Ala Lys He Lys 
50 55 
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Gly Ala Val Phe Ser Ala Pro Gly 

10 15 
Asn Ala Ser Phe Gly Gin Asn Ala 
25 30 
Lys Leu Gly Thr Phe Ala Glu Leu 
45 



(2) INFORMATION FOR SEQ ID NO:346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 



Met lie Gin Ala Val Phe Glu Arg Ala Glu Asp Gly Glu Leu Arg Ser 

15 10 15 

Ala Glu lie Thr Gly His Ala Glu Ser Gly Glu Tyr Gly Leu Asp Val 

20 25 30 

Val Cys Ala Ser Val Ser Thr Leu Ala He Asn Phe He Asn Ser He 

35 40 45 

Glu Lys Phe Ala Gly Tyr Glu Pro He Leu Glu Leu Asn Glu Asp Glu 

50 55 60 

Gly Gly Tyr Leu Met Val Glu He Pro Lys Asp Leu Pro Ser His Gin 
65 70 . 75 80 

Arg Glu Met Thr Gin Leu Phe Phe Glu Ser Phe Phe Leu Gly Met Ala 

85 90 95 

Asn Leu Ser Glu Asn Ser Ser Glu Phe Val Gin Thr Arg Val He Thr 
100 105 HO 

Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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. (ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 

Met Thr Ala lie Ser Met Lys Trp Leu Asn Asp Tyr lie Trp Pro Ala 

15 10 15 

Glu Ser Glu Phe Thr Pro Asp Met Thr Thr Asn Ala Val Lys Glu Ala 

20 25 30 

Leu Thr Glu Met Leu Gin Ser Gly Thr Thr Thr Phe Asn Asp Met Tyr 

35 40 45 

Asn Pro Asn Gly Val Asp lie Gin Gin lie Tyr Gin Val Val Lys Thr 

50 55 60 

Ser Lys Met Arg Cys Tyr Phe Ser Pro Thr Leu Phe Ser Ser Glu Thr 
65 70 75 80 

Glu Thr Thr Ala Glu Thr He Ser Arg Thr Arg Ser He He Asp Glu 

85 90 95 

lie. Leu Lys Tyr Lys Asn Pro Lys Phe Gin Gly Leu Trp 
100 105 



(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

<C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

Met Val Arg Glu Ser Ala Glu Ser Ala Gly Phe Phe Leu Glu Thr His 

15 10 15 

Met Val Gin Gly Glu Trp Asn Thr Cys Val Phe Lys Lys Thr Lys Asp 

20 25 30 

He Ser Gly Val lie Gly Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 : 

Met Thr Val Asp Arg Glu Gly Phe Glu Ala Ala Met Lys Glu Gin Gin 

15 10 15 

Glu Arg Ala Arg Ala Ser Ala Val Lys Gly Gly Ser Met Gly Met Gin 

20 25 30 

Asn Gly Asn Ser Ser Lys His His Cys Arg Lys Cys Leu Gin Leu Gin 
35 40 45 

Cys 



(2) INFORMATION FOR SEQ ID NO: 3 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 0: 



Met Lys Ala Gin 
1 

Asn Thr Val Val 
20 

Tyr Asp Leu Asp 

35 

Arg Val Val Leu 
50 

Glu Val Ala Leu 
65 

Phe Pro Tyr lie 

Lys Val Glu Asn 
100 



Leu Val Pro Ser 
5 

Leu Phe Phe Gly 

Leu Val Arg Glu 
40 

Leu Ser Asp Gin 
55 

Gly Cys Gly Gly 
70 

Val Tyr Ala Gin 
85 

Lys Pro Asp Thr 



Arg Ser Lys Tyr 
10 

Thr Arg Thr Asp 
25 

Val Ala Gly Asp 

Ala Phe Gly Leu 
60 

Val Leu Asn Asp 
75 

Leu Phe Ala Leu 
90 

Pro Ser Pro Thr 
105 



Leu lie Asn Asp 
15 

Tyr Thr Arg Lys 
30 

Gin lie Ala Arg 
45 

Glu Asn Val Lys 

lie Tyr Arg Val 
80 

Leu Thr Ser Leu 
95 

Gly Thr Val Asn 
110 
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Arg Val Val Gin Gly Val lie lie His Glu Tyr Gin Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 3 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 



Met Ala Val Asn Asn Glu He Gly Ser He Gin Xaa He Glu Ala He 

15 10 15 

Ser Glu Phe Leu Ala Asp Lys Pro Thr He Ser Phe His Val Asp Ala 

20 25 30 

Val Gin Ala Leu Ala Lys He Pro Thr Glu Lys Tyr Leu Thr Glu Arg 

35 40 45 

Val Asp Cys Ala Thr Phe Ser Ser His Lys Phe His Gly Val Arg Gly 

50 55 60 

Val Gly Phe Val Tyr He Lys Ser Gly Lys Lys He Thr Pro Leu Leu 
65 70 75 so 

Thr Gly Gly Gly Gin Glu Arg Asp Tyr Arg Ser Thr Thr Glu Asn Val 

85 90 95 

Ala Gly He Ala Ala Thr Ala Lys Ala Leu Arg Leu Ser Met Glu Lys 

100 105 no 

Leu Asp He Phe Arg Ser Lys Thr Gly Gin Met Lys Ala Val He His 

115 120 125 

Gin Ala Leu Leu Asn Tyr Pro Asp He Phe Val Phe Ser Asp Glu Glu 

13 0 135 140 

Asp Phe Ala Pro His He Leu Thr Phe Gly He Lys Gly Val Arg Gly 
145 "0 155 i 6 o 

Glu Val He Val His Ala Phe Glu Asp Tyr Asp He Phe He Ser Thr 

165 170 175 

Thr Ser Ala Cys Ser Ser Lys Ala Gly Lys Pro Ala Gly Thr Leu lie 

180 185 iso 

Ala Met Gly Val Asp Lys Asp Lys Ala Lys Ser Ala Val Arg Leu Ser 

195 200 205 

Leu Asp Leu Glu Asn Asp Met Ser Gin Val Glu Gin Phe Leu Thr Lys 
21° 215 220 
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(2) INFORMATION FOR SEQ ID NO: 3 52: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

Met Leu Val Thr Gly Ala He Leu Gly Val Asn Val His lie Phe Ser 

1 5 10 15 

Pro Lys Glu Leu Phe Pro Glu Lys Glu He Val Glu Leu Ala Glu Gly 

20 25 30 

Phe Ala Lys Glu Ser Gly Ala His Val Leu lie Thr Glu Asp Ala Asp 

35 40 45 

Glu Ala Val Lys Asp Ala Asp Val Leu Tyr Thr Asp Val Trp Val Ser 

50 55 60 

Met Gly Glu Glu Asp Lys Phe Ala Glu Arg Val Ala Leu Leu Lys Pro 
65 70 75 80 

Tyr Gin Val Asn Met Asp Leu Val Lys Lys Ala Gly Asn Glu Asn Leu 

85 90 95 

He Phe Leu His Cys Leu Pro Ala Phe His Asp Thr His Thr Val Tyr 

100 105 110 

Gly Lys Asp Val Ala Glu Lys Phe Gly Val Glu Glu Met Glu Val Thr 

115 120 125 

Asp Glu Val Phe Arg Ser Lys Tyr Ala Arg His Phe Asp Gin Ala Glu 

130 135 140 

Asn Arg Met His Thr He Lys Ala Val Met Ala Ala Thr Leu Gly Asn 
145 150 155 160 

Leu Tyr He Pro Lys Val 
165 

(2) INFORMATION FOR SEQ ID NO: 353: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single. 
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(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 

Met Arg Leu Glu Gin Asp Cys Pro Val Phe Leu Lys lie Lys Glu Lys 

15 10 15 

Asp Met Ala Ser Lys Met Leu His Thr Cys Leu Arg Val Glu Asn Leu 

20 25 30 

Glu Lys Ser lie Ala Phe Tyr Gin Asp Ala Phe Gly Phe Lys Glu Leu 

35 40 45 

Arg Arg Arg Asp Phe Pro Asp His Ala Phe Thr lie Val Tyr Leu Gly 

50 55 60 

Leu Glu Gly Asp Asp Tyr Glu Leu Glu Leu Thr Tyr Asn Tyr Asp His 
65 70 75 80 

Gly Pro Tyr Val Val Gly Asp Gly Phe Ala His He Ala Leu Ser Thr 

85 90 95 

Pro Asp Leu Glu Ala Leu His Gin Glu His Ser Thr Lys Gly Tyr Glu 

100 105 110 

Val Thr Glu Pro Asn Gly Leu Pro Gly Thr Ala Pro Asn Tyr Tyr Phe 

115 120 125 

Val Lys Asp Pro Asp Gly Tyr Lys Val Glu Val He Arg Glu Lys 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 54: 

Met Asp His Thr lie Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp 

15 10 15 

Leu Glu Gly Thr. Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro 

20 25 30 

Phe Gly Pro Leu Phe Gly Lys lie Lys Asn Gly Gin Asp Leu Val Leu 
35 40 .45 
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Glu Asp Gly Thr Glu lie Lys Ala Ala Asp Tyr lie Ser Ala Pro Arg 

50 55 60 

Pro Gly Lys He He Thr He Leu Gly Asp Thr Arg Lys Thr Asp Ala 
65 70 75 go 

Ser Val Arg Leu Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr 

85 90 95 

Tyr Gly Lys Gly Asp Glu Lys lie Ala Arg Asn His Gly His Ser Thr 

100 105 no 

Asn Met Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu 

120 125 
Leu Leu Asn His He Ser Ala Arg Phe Leu Ser Lys Asp He Ser Lys 

130 135 140 

Leu Lys Lys Asp Ala Ala Thr He Phe Glu Asn Val His Val Val Lys 
145 150 155 160 

Asp Leu Glu Lys Met Glu He Tyr Gin Ser Gin Lys Gly 
165 170 

(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

{ C } STRANDEDNESS : s ingl e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 55: 



Met He Gin Pro Ala Ser Leu Glu 

1 5 
Ala Gly Lys Lys Val Phe Leu Phe 
20 

Arg Tyr He Tyr Pro Ala Leu Pro 

35 40 
Phe Thr Phe He Arg Met Asp Arg 

50 55 
Leu Trp Asp Val Tyr Gly He Pro 
65 70 
Lys Glu He Gly Arg Phe Val Asn 
85 

He Asn Asp Phe Leu Ala Gly Leu 
100 



Glu Leu Ala Ser Leu Val Glu Lys 

10 15 
Val Ala Asp Trp Cys Gly Asp Cys 
25 30 
Glu lie Glu Glu Thr Asn Pro Glu 
45 

Asp Gin Tyr Met Asp Leu Ala Lys 
60 

Ser Leu Val Val Leu Glu Lys Asp 

75 80 
Arg Asp Arg Lys Ser Lys Glu Gin 
90 95 

Lys 
105 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

Met Arg Leu Gly Leu Leu Gin Val Leu Arg Leu Pro Lys Ala Phe Gin 

15 10 15 

Leu lie Phe lie Gin Asp Lys Gly His Gly Asp Val Ser Ser Phe Thr 

20 25 30 

Ala Ser Cys Val Thr Gly Pro Trp Ala Phe Lys Leu Val Gly Lys Gin 

35 40 45 

Gly Asn lie His 
50 

(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Met Lys Thr Arg Lys He Pro Leu Arg Lys Ser Val Val Ser Asn Glu 

15 10 15 

Val He Asp Lys Arg Asp Leu Leu Arg He Val Lys Asn Lys Glu Gly 

20 25 30 

Gin Val Phe He Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 

35 40 45 

He Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 

50 55 60 

Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 75 80 
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Leu He Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 
85 90 95 

Glu 



(2) INFORMATION FOR SEQ ID NO: 3 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

Met Arg Val Ala Tyr Phe Lys Val His His Pro He Tyr Tyr Tyr Cys 

1 5 10 is 

Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Asp He Lys Thr Met Gly 

20 25 30 

Ala Gly Leu Glu Val He Lys Arg Arg Met Glu Glu He Ser Glu Lys 

35 40 45 

Arg Lys Asn Asn Glu Ala Ser Asn Val Glu He Asp Leu Tyr Thr Thr 

50 55 60 

Leu Glu He Val Asn Glu Met Trp Glu Arg Gly Phe Lys Phe Gly Lys 
65 70 75 80 

Leu Asp Leu Tyr Cys Ser Gin Thr Thr Glu Phe Leu He Asp Gly Asp 

85 90 95 

Thr Leu He Pro Pro Phe Val Ala Met Asp Gly Leu Gly Glu Asn Val 

100 105 HO 

Ala Lys Gin Leu Val Arg Ala Arg Glu Glu Gly Glu Phe Leu Ser Lys 

115 120 125 

Thr Glu Leu Arg Lys Arg Gly Gly Leu Ser Ser Thr Leu Val Glu Lys 

130 135 140 

Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu He Thr Ser 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 35 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 59: 

Met Ser Ser Lys lie Ser He Gly Gin Leu He Thr Phe Asn Thr Leu 

15 10 15 

Leu Ser Tyr Phe Thr Thr Pro Met Glu Asn He He Asn Leu Gin Thr 

. 20 25 30 

Lys Leu Gin Ser Ala Lys Val Ala Asn Asn Arg Leu Asn Glu Val Tyr 

35 40 45 

Leu Val Glu Ser Glu Phe Gin Val Gin Glu Asn Pro Val His Ser His 

50 55 60 

Phe Leu Met Gly Asp He Glu Phe Asp Asp Leu Ser Tyr Lys Tyr Gly 
65 70 75 80 

Phe Gly Arg Asp Thr Leu Thr Asp He Asn Leu Thr He Lys Gin Gly 

85 90 95 

Asp. Lys Val Ser Leu Val Gly Val Ser Gly Ser Gly Lys Thr Thr Leu 

100 105 HO 

Ala Lys Met lie Val Asn Phe Phe Glu Pro Tyr Lys Gly His lie Ser 

115 120 125 

He Asn His Gin Asp He Lys Asn He Asp Lys Lys Ser Leu Ala Pro 

130 135 140 

Ser Tyr 
145 



(2) INFORMATION FOR SEQ ID NO: 3 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

Met Arg Gin Ala Pro Ala Leu Ala Gin Ala Asn He Glu Arg Val Val 

15 10 15 

Val His Lys He Ser Lys Val Trp Glu Phe His Phe Arg He Phe 
20 25 30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 61: 

Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg 

1 5 io 15 

Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp 

20 25 30 

Tyr Tyr Leu Asp Gin Pro Asn Gly Asp Met Lys Thr Gly Trp Gin Asn 

35 40 45 

Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr 

50 55 60 

Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp Gin Pro Asn. 
65 70 75 80 

Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp Tyr Tyr 

85 90 95 

Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp Gly Ser 

100 105 no 

Thr Trp Tyr Tyr Leu Asn Ala Val Met Glu He 
115 120 

(2) INFORMATION FOR SEQ ID NO: 3 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 



Met Lys Trp Gly Ser Arg He Leu Leu Gly Leu Thr Pro Lys Ser Leu 
1 5 10 15 
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Arg Tyr Arg He Trp Lys Lys Ala 
20 

Leu Ala Asp Cys Asp Gly He Thr 

35 ' 40 

Met Arg Asn Lys Tyr Pro He Thr 

50 55 
Pro Phe Glu Gly Thr Glu Met Pro 
65 70 
Leu Arg Thr Ala Phe Gly Asp Tyr 
85 

Gin Val Pro His Gin Asp Ala Val 
100 

Thr Glu Tyr Lys Gly Glu Tyr Gly 
115 120 
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Glu Lys Glu Met Thr Lys Tyr Asp 
25 30 
Glu Leu Cys Ser Gly Pro Gly Tyr 
45 

Ser Phe Glu Asp Asn Leu Phe Leu 
60 

He Pro He Gly Tyr Asp Val Tyr 

75 80 
Met Thr Pro Pro Pro Ala Asp Lys 

90 95 
He Ala Asp Met Asp Lys Ser Tyr 
105 no 
Gly 



(2) INFORMATION FOR SEQ ID NO: 3 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



{iij MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:363: 

Met Phe Pro Asp Ser He Asp Thr Met His Gin Ala Asn Glu Phe lie 

1-5 10 15 

Ala Leu Asp Asp Leu Phe Arg Ala Ala Ala He Tyr Ala Glu Ala lie 

20 25 30 

Tyr Glu Leu He Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 3 64: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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Met Asp Glu Leu Gin Lys Arg Asn Leu Leu Gly Phe Val Phe Gin Asp 

1 . 5 10 15 

Phe Gin Leu Phe Pro His Leu Ser Val Leu Glu Asn Leu Thr Leu Ser 

20 25 30 

Pro Val Lys Thr Met Gly Met Lys Gin Glu Glu Ala Glu Lys Lys Ala 

35 40 45 

Ser Gly Leu Leu Glu Gin Leu Gly Leu Gly Gly His Ala Glu Ser Tyr 

50 55 60 

Pro Phe Ser Leu Ser Gly Gly Gin Lys Gin Arg Val Ala Leu Ala Arg 
65 70 75 80 

Ala Met Met lie Asp Pro Glu He He Gly Tyr Asp Glu Pro Thr Ser 

85 90 95 

Ala Leu Asp Pro Glu Leu Arg Leu Glu Val Glu Lys Leu He Leu Gin 

100 105 no 

Asn Arg Glu Leu Gly Met Thr His He Val Val Thr His Asp Leu Gin 
115 120 125 

Phe Gly 
130 

(2) INFORMATION FOR SEQ ID NO: 3 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 65: 

Met Tyr Gly Ala Asn Ser Glu Arg Leu Lys Glu Ser Leu Arg He Ser 

1 5 io 15 

Leu Ser Pro Gin Asn Thr Val Glu Asp Leu Gin Thr Leu Ala Lys Thr 

20 25 30 

Leu Lys Glu lie He Gly Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 6: 

Met lie Ala Arg Ser Tyr Gin Glu Met Asp Asp Leu Asp Thr Ala Tyr 

1.5 10 15 

Glu His Tyr Gin Glu Leu Thr Gly Asp Leu Lys Asp Asn Pro Glu Phe 

20 25 30 

Leu Glu His Tyr lie Tyr Leu Leu Arg 
35 40 

(2> INFORMATION FOR SEQ ID NO: 3 67: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 67: 



Met Lys His Phe 
1 

Met Ala Thr He 

20 

Glu Lys Asn Arg 
35 

Arg Cys Asn Val 
50 

He Pro Gly Asn 
65 

Asn His Asp He 

Val Glu Asp His 
100 

He He Glu Ala 
115 



Asp Thr He Val 
5 

Ser Ser Ser Phe 

Lys Leu Gly Lys 
40 

Thr Asn Asn Gly 
55 

Gly Arg Phe Leu 
70 

He Asn Phe Phe 
85 

Gly Arg Val Phe 

Leu Glu Lys Lys 
120 



He Gly Gly Gly 
10 

Tyr Gly Gin Lys 
25 

Lys Leu Ala Gly 

Ser Leu Asp Asn 
60 

Tyr Ser Val Phe 
75 

Thr Glu Asn Gly 
90 

Pro Ala Ser Asp 
105 

He Thr Glu Leu 



Pro Ala Gly Met 
15 

Thr Leu Leu He 
30 

Thr Gly Gly Gly 
45 

Leu Leu Ala Gly 

Ser Gin Phe Asp 
80 

Val Lys Leu Lys 
95 

Lys Ser Arg Thr 
110 

Gly Gly Gin Val 
125 
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(2) INFORMATION FOR SEQ ID NO: 3 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 



Met Thr Leu Ala 
1 

Val Tyr Leu Lys 
20 

Ser Pro lie Tyr 
35 

Arg Thr Leu lie 
50 

Pro Glu Val Glu 
65 

Gly Ala lie lie 



Lys Asp lie Ala 
5 

Pro Glu Glu Pro 

Thr Asp Asn Arg 
40 

Glu Asn Gly Phe 
55 

Val He Ala Gly 
70 

Ala Asp Lys Met 
85 



Ser His Leu Leu 
10 

Phe Thr Trp Ala 
25 

Val Thr Leu Ala 

Val Glu Ala He 
60 

Thr Ala Thr Ala 
75 

Asp Leu Pro Phe 

90 



Lys He Gin Ala 
15 

Ser Gly He Lys 
30 

Tyr Pro Glu Thr 
45 

Lys Glu Ala Phe 

Gly He Pro His 
80 

Ala Tyr He Pro 
95 



(2) INFORMATION FOR SEQ ID NO: 3 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

Met Val Lys Val Leu Ala Ala Cys Gly Asn Gly Met Gly Ser Ser Met 

1 5 10 15 

Val He Lys Met Lys Val Glu Asn Ala Leu Arg Lys Leu Asn Gin Thr 

20 25 30 
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Asp Phe Thr Val Asn Ser Cys Ser Val Gly Glu Ala Lys Gly Leu Ala 

35 40 45 

Val Gly Tyr Asp lie Val lie Ala Ser Leu His Leu lie Gin Glu Leu 

50 55 60 

Glu Gly Arg Thr Asn Gly Lys Leu lie Gly Leu Asp Asn Leu Met Asp 
65 70 75 80 

Asp Lys Glu He Thr Glu Lys Leu Ser Gin Ala He Gin 
85 90 

(2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 370 : 

Met Ser Met Phe Leu Asp Thr Ala Lys He Lys Val Lys Ala Gly Asn 

1 5 10 15 

Gly Gly Asp Gly Met Val Ala Phe Arg Arg Glu Lys Tyr Val Pro Asn 

20 25 30 

Gly Gly Pro Trp Gly Gly Asp Gly Gly Arg Gly Gly Asn Val Val Phe 

35 40 45 

Val Val Asp Glu Gly Leu Arg Thr Leu Met Asp Phe Arg Tyr Asn Arg 

50 55 60 

His Phe Lys Ala Asp Ser Gly Glu Lys Gly Met Thr Lys Gly Met His 
65 ™. 75 so 

Gly Arg Gly Ala Glu Asp Leu Arg Val Arg Val Pro Gin Gly Thr Thr 

85 90 95 

Val Arg Asp Ala Glu Thr Gly Lys Val Leu Thr Asp Leu lie Glu His 

100 1Q5 110 

Gly Gin Glu Phe He Val Ala His Gly Gly Pro Trp Trp Thr Trp Lys 

115 120 125 

Tyr Ser Phe Arg Asp Thr Lys Lys Ser Cys Thr Gly Asn Leu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 3 71: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 210 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear. 



(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 



Met Met Lys Gly Glu Met Thr Phe Lys Gin Val His Tyr Lys Tyr Gly 

1.5 10 15 

Tyr Gly Arg Asp Val Leu Ser Asp He Asn Leu Thr Val Pro Gin Gly 

20 25 30 

Ser Lys Val Ala Phe Val Gly He Ser Gly Ser Gly Lys Thr Thr Leu 

35 40 45 

Ala Lys Met Met Val Asn Phe Tyr Asp Pro Ser Gin Gly Glu He Ser 

50 55 60 

Leu Gly Gly Val Asn Leu Asn Gin He Asp Lys Lys Ala Leu Arg Gin 
65 70 75 80 

Tyr He lie Tyr Leu Pro Pro Gin Pro Tyr Val Phe Asn Gly Thr He 

85 90 95 

Leu Glu Asn Leu Leu Leu Gly Ala Lys Gly Gly Asp Asp Thr Gly Lys 

100 105 HO 

He Ser Leu Thr Gly Arg Ser Glu Phe Gly Ser Glu He Pro Lys Arg 

115 120 125 

Asn He Ser Lys Pro Pro Cys His Leu Glu lie Tyr Gin Thr Glu Leu 

130 135 140 

Thr Ser Asp Gly Ala Gly He Ser Gly Gly Gin Arg Gin Arg He Ala 
145 150 155 160 

Leu Ala Arg Ala Leu Leu Thr Asp Ala Pro Val Leu lie Leu Asp Glu 

165 170 175 

Ala Thr Asn Ser Leu Asp He Leu Thr Lys Lys Arg He Val His Tyr 

180 185 190 

Leu Met Ala Leu Asp Lys Thr Phe Asp Phe His Cys Ser Pro Leu Asp 
.195 200 205 

Tyr Cys 
210 

(2) INFORMATION FOR SEQ ID NO: 372: 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 73 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 72: 



Met Ala Arg Phe lie Glu lie Asp 

1 5 
Ser Gin lie Ser His Phe Pro His 
20 

Gin Thr Ala Val Tyr Ala Gin Glu 

35 40 
Gly Arg Trp Phe Ser Arg Tyr Asp 

50 55 
Asn Val Asp Leu His Ser Leu Val 
65 70 



Ala Lys Glu His Asp Arg Val Thr 

10 15 
lie Leu Ala Ser Ser Leu Met Glu 
25 30 
His Glu Asn Gly Lys Ala Leu Cys 
45 

Pro Asn Cys Gly Lys Arg Ala Arg 
60 

Gin 



(2) INFORMATION FOR SEQ ID NO: 373: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 



Met Ala His Gin Gly Gin 

1 5 
Lys Glu Asp Asn Leu Glu 
20 

lie Asn Leu His His Asn 
35 

Glu Asp Leu Cys Ala Ser 
50 

Ala Lys Thr Lys Lys Ala 
65 70 
Val Ala Gly Gly Val Ala 
85 

Ala Glu Val Thr Asp Val 
100 

Gly Asp Asn Ala Gly Met 
115 



Asp lie Tyr Asp Phe 

10 

Phe Ser Phe Ser Gly 
25 

Ala Glu Gin Lys Gly 
40 

Phe Gin Ala Ala Val 
55 

Leu Glu Lys Tyr Pro 
75 

Ala Asn Lys Gly Leu 
90 

Lys Val lie lie Pro 
105 

He Ala Tyr Ala Ser 
120 
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Pro Arg Ala Met He 
15 

Leu Lys Ser Ala Phe 
30 

Glu Ser Leu Ser Thr 
45 

Leu Asp He Leu Met 
60 

Val Lys Thr Leu Val 
80 

Arg Glu Arg Leu Ala 
95 

Pro Leu Arg Leu Cys 
110 

Val Ser Glu Trp Asn 
125 
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Lys Glu Asn Phe Ala Asn Leu Asp 

130 135 
Phe Asp Thr Met Glu 
145 
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Leu Asn Ala Lys Pro Ser Leu Ala 
140 



(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

Met Lys Arg Ser Leu Asp Ser Arg Val Asp Tyr Ser Leu Leu Leu Pro 

15 10 15 

Val Phe Phe Leu Leu Val He Gly Val Val Ala He Tyr He Ala Val 

20 25 30 

Ser His Asp Tyr Pro Asn Asn He Leu Pro He Leu Gly Gin Gin Val 

35 40 45 

Ala Trp He Ala Leu Gly Leu Val He Gly Phe Val Val Met Leu Phe 

50 55 60 

Asn Thr Glu Phe Leu Trp Lys Val Thr Pro Phe Leu Tyr He Phe Arg 
65 70 75 80 

Leu Gly Thr Tyr Asp Leu Ala Asp Cys lie Leu 
85 90 



(2) INFORMATION FOR SEQ ID NO: 3 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 



Met Pro Asp Asp He He Asp Ser Phe Trp Tyr He He Asp His Phe 
15 10 is 

286 



WO 97/37026 PCT/US97/05306 

Leu Lys Asn Val Phe Glu Leu Glu Glu Glu Leu Glu Phe Gin Leu Leu 

20 25 30 

Asn Asn Gin Gly Lys lie Thr Phe His Phe Ser Ser Gin His Leu Pro 

35 40 45 

Thr Ala lie Asp Phe Asp Phe Asn His Pro Phe Asp Pro Arg Tyr Pro 

50 55 60 

Pro Arg Val Leu Val Leu Asp Met Asp Gly Arg Glu Thr He Leu Leu 



80 



65 70 75 

Pro Glu Glu Asn Asp Leu Phe 
85 

(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: . protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

Met Gly Lys Lys Asp Ala Ser Ala Met His Glu Met Arg Ala Ser Phe 

15 10 15 

He Gin Gly Ser He Glu Ala Gly His Thr Ala Glu Lys Ser Glu Gin 

20 25 30 

Val Phe Asp Val Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser 

35 40 45 

His Ala Tyr Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys 

50 . 55 60 

Thr His Tyr Pro Ala He Phe Tyr Gin Val Met Leu Asn Tyr Se- Asn 
65 70 75 80 

Ser Asp Tyr Leu He Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Ser 

85 90 95 

Leu Ser He Asn Thr He Pro Tyr His Asp Lys He Ala Asn Lys Ser 

ioo 105 11Q 

He Tyr He Gly Leu Lys Ser He Lys Gly Leu Gin Gin Gly Leu Gly 

US 120 125 

Ala Leu Asp Tyr 
130 

(2) INFORMATION FOR SEQ ID NO:377: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

Met Asp Lys Lys Gin Asn Leu Thr Ser Phe Gin Glu Leu Thr Thr Thr 

15 10 is 

Glu Leu Asn Gin lie Thr Gly Gly Glu Trp Trp Glu Glu Leu Leu His 

20 25 30 

Glu Thr lie Leu Ser Lys Phe Lys He Thr Lys Ala Leu Glu Leu Pro 

35 40 45 

He Gin Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 378: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 378: 

Met His Glu Met Phe Thr Ala He Ala Glu Ser Asp Met Lys Lys Ala 

1 5 10 15 

Ala Ala He Gin Arg Lys Phe He Pro Lys Val Asn Ala Leu Phe Ser 

20 25 30 

Tyr Pro Ser Pro Ala Pro Val Lys Ala He Leu Asn Tyr Met Gly Phe 

35 40 45 

Glu Ala Gly Pro Thr Arg Leu Pro Leu Val Pro Ala Pro Glu Glu Asp 

50 55 60 

Val Lys Arg He He Lys Val Val Val Asp Gly Asp Tyr Glu Ala Thr 
65 70 75 80 

Lys Ala Thr Val Thr Gly Val Leu Arg Pro Asp Tyr 
85 90 
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(2) INFORMATION FOR SEQ ID NO: 379: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 



Met Leu Asn Glu Phe Pro lie Phe 

1 5 
Pro Asn Lys Cys Val He Lys Ser 
20 

Thr Lea Gly Asn His Thr Phe Lys 

35 40 
Gin Thr lie Leu Asp Glu Asn Val 

50 55 
Tyr Leu Tyr Thr Tyr Ala Pro Phe 
65 70 



Asp Tyr Glu Asp He Gin Leu He 

10 15 
Arg Ala Glu Ala Asp Thr Ser Val 
25 30 
Leu Pro Val Val Pro Ala Asn Met 
45 

Ala Glu Gin Leu Ala Lys Gly Gly 
60 



(2) INFORMATION FOR SEQ ID NO: 3 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

Met Asp He Lys Leu Lys Asp Phe Glu Gly Pro Leu Asp Leu Leu Leu 

1 5 10. 15 

His Leu Val Ser Lys Tyr Gin Met Asp lie Tyr Asp Val Pro He Thr 

20 25 30 

Glu Val He Glu Gin Tyr Leu Ala Tyr Val Ser Thr Leu Gin Ala Met 

3 5 40 45 

Arg Leu Glu Val Thr Gly Glu Tyr Met Val Met Ala Ser Gin Leu Met 
50 55 60 
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Leu lie Lys Ser Arg 
65 

Leu Gly Asp Asp Leu 
85 

Arg Lys Phe Lys Leu 
100 

Thr Gly Pro Val Leu 
115 

Cys Gly Ala Cys Ala 
130 



Lys Leu Leu Pro Lys Val 
70 75 
Glu Gin Asp Leu Leu Ser 
90 

Leu Gly Glu His Leu Glu 
105 

Phe Gin Ser Ala Asp Arg 
120 
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Ala Glu Val Thr Asp 
80 

Gin lie Glu Glu Tyr 
95 

Ala Lys His Gin Glu 
110 

Val Asp Leu Arg Arg 
125 



(2) INFORMATION FOR SEQ ID NO: 381: 



( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

<C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

Met Gin Ser Thr Glu Lys Lys Pro Leu Thr Ala Phe Thr Xaa lie Ser 

1 . 5 10 15 

Thr lie lie Leu Leu Leu Leu Thr Val Leu Xaa lie Phe Pro Phe Tyr 

20 25 30 

Trp lie Leu Thr Gly Ala Phe Lys Ser Gin Pro Asp Thr lie Val lie 

35 40 45 

Pro Pro Gin Trp Phe Pro Lys Met Pro Thr Met Glu Asn Phe Gin Gin 

50 55 60 

Leu Met Val Gin Asn Pro Ala Leu Gin Trp Met Trp Asn Ser Val Phe 
65 70 75 80 

lie Ser Leu Val Thr Met Phe Leu Val Cys Ala Thr Ser Ser Leu Ala 

85 90 95 

Gly Tyr Val Leu Ala Xaa Lys Arg Phe Tyr Gly Xaa Arg lie Tyr Cys 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 82: 

Met Leu Phe Met Arg Asp Ser Leu Asp Ser He Val Glu Pro Val Leu 

15 10 15 

Asp Glu Met Gly Arg Phe Tyr Asp Trp Thr Glu Glu Glu Lys Ala Thr 

. 20 25 30 

Tyr Arg Ala Asp Val Lys Ala Ala Leu Ala Gin Asn Asp Leu Ala Glu 

35 40 45 

Leu Lys Asn 
50 

(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

Met Lys He Lys Val Val Thr Val Gly Lys Leu Lys Glu Lys Tyr Leu 

15 10 15 

Lys Asp Gly He Ala Glu Tyr Ser Lys Arg He Ser Arg Phe Ala Lys 

20 25 30 

Phe Glu Met He Glu Leu Ser Asp Glu Lys Thr Pro Asp Lys Ala Ser 

35 40 45 

Glu Ser Glu Asn Gin Lys He Leu Glu He Glu Gly Gin Arg He Leu 

50 55 60 

Ser Lys He Ala Asp Arg Asp Phe Val He Val Leu Ala He Glu Gly 
65 .70 75 ao 

Lys Thr Phe Phe Ser Glu Glu Phe Ser Lys Gin Leu Glu Glu Thr Ser 

85 90 95 

He Lys Gly Xaa Ser Thr Leu Thr Phe He He Gly Gly Ser Leu Gly 

100 105 no 

Leu Ser Ser Ser Val Lys Asn Arg Ala Asn Leu Ser Val Ser Phe Gly 
115 120 125 
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Arg Leu Thr Leu Pro His Gin Leu Met Arg Leu Val Leu Val Glu Gin 

130 135 140 

lie Tyr Arg Ala Phe Thr lie Gin Gin Gly Phe Pro Tyr His Lys 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 84 amino acids 
(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

Met Gly lie Leu Ala Trp Cys Asp Leu Val Leu Tyr Arg Leu Trp Asp 

15 10 15 

Ser Phe Met Asp Leu Gly Leu Phe lie Asn Asp Ala Trp Val Arg Lys 

20 25 30 

Lys Lys Thr Leu Asn Lys Glu Arg Lys Lys Ala Gly Lys Ala Ala Leu 

35 40 45 

Pro Glu Asn Arg Trp lie Gin Leu Leu Gly Met Val Val Thr Phe His 

50 55 60 

Val Val Met Leu Ser Phe Leu lie Phe Ser Gly Phe Leu Asn Asn Leu 
65 70 75 80 

Trp Phe Lys Lys 



(2) INFORMATION FOR SEQ ID NO: 3 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

Met Val Met Ala Glu Asp Gin Ala Val Arg Gin Asn Arg Leu Ala lie 

1 5 10. 15 
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Leu Ser Gin Leu Thr Lys Lys Ala Ala Lys Phe Ala Cys Phe Asn Gin 

20 25 30 

lie Asn Thr Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 3 86: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 86: 

Met Gly Ala Gin Pro Val Gin Asp Thr Glu Thr Ser Ser Ala Leu lie 

1 • 5 10 is 

Ser Ser His Tyr Leu Asp Glu Gin Asp Leu Ser Glu Lys Leu Lys Ser 

20 25 30 

Glu Leu Gin Trp Phe Glu Leu Glu Asn Lys Leu Leu Asn Leu Trp Glu 
35 40 45 

His 



(2) INFORMATION FOR SEQ ID NO: 3 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

Met Asp Glu Val Thr He Phe Gly lie Asn Tyr Phe Lys Glu His Tyr 

1 5 10 15 

Pro Glu Lys Leu Ala Glu Arg Phe Lys Gin Met Lys He Glu Glu Glu 

20 25 30 

Ala Pro Val He He Met Asp Met Thr Arg Ala Leu Gly Phe Arg Asp 

35 40 45 
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Asp Tyr Asp Arg Phe Tyr Ser Leu Phe Arg Glu Gly Ser Pro 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 3 88: 



(i) SEQUENCE CHARACTERISTICS: 
• (A) LENGTH: 45 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 



Met Lys Val lie Asn Gin Thr Leu 

1 . 5 
Ser Arg Ser Ser His Lys Gly Asp 
20 

Gly Thr Tyr Pro Tyr Gly Val Xaa 
35 40 



Leu Glu Lys Val lie lie Glu Arg 

10 15 
Tyr Gly Xaa Leu Leu Leu Leu Gly 
25 30 
Ser Ser Trp Leu Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 3 89: 

-<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

Met Asn Trp lie Leu Leu Gin Gly Leu lie Cys Gin Arg Asp Ala Ser 

15 10 15 

Tyr Asp Met Lys Gin Asp Asp Leu Asp Lys Val Ala Asp Tyr Leu Phe 

20 25 30 

Lys Thr Glu Glu Trp Thr Met Tyr Glu Leu lie Leu Phe Gly Asn Leu 

35 40 45 

Tyr Ser Phe Tyr Asp Val Asp Tyr Val Thr Arg lie Gly Arg Glu Val 

50 55 60 

Met Glu Arg Glu Glu Phe Tyr Gin Glu lie Ser Arg His Lys Arg Leu 
65 70 75 80 
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Val Leu He Leu Ala Leu Asn Cys Tyr Gin His Cys Leu Glu His Ser 

85 90 95 

Ser Phe Tyr Asn Ala Asn Tyr Phe Glu Ala Tyr Thr Glu Lys He He 

100 105 no 

Asp Lys Gly He Lys Leu Tyr Glu Arg Asn Val Phe His Tyr Leu Lys 

115 120 125 

Gly Phe Ala Leu Tyr Gin Lys Gly Gin Cys Lys Glu Gly Cys Lys Gin 

130 135 140 

Met Gin Glu Thr Met His He Phe Asp Val Leu Gly Leu Pro Glu Gin 
145 150 155 i60 

Val Ser Leu Leu Ser Gly Thr Leu Arg Lys He Cys Gin Lys Leu lie 

i65 170 175 

Phe Pro Asn Lys Gly Lys Asn Lys Lys Leu Leu Ser Val Leu lie Gin 
180 185 i9o 

(2) INFORMATION FOR SEQ ID NO: 390 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met Ser Thr Val Leu Ser Trp Thr Ala Tyr Lys Thr Gin Arg Leu Glu 

1 5 10 15 

Met Ser lie Leu Leu His Met He Val Asn Gly He Ala Phe Cys Leu 

20 25 30 

Leu Ala Leu Val Val He Met Ser Arg Thr Leu Gly lie Ser Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 3 91: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: 



SEQ ID NO: 391: 
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Met Arg Glu Asn Asp Leu Leu Leu lie Thr Ala Asp His Gly Asn Asp 

15 10 15 

Pro Thr Tyr Ala Gly Thr Asp His Thr Arg Glu Tyr He Pro Leu Leu 

20 25 30 

Ala Tyr Ser Pro Ala Phe Lys Gly Asn Gly Leu He Pro Val Gly His 

35 40 45 

Phe Ala Asp He Ser Ala Thr Val Ala Asp Asn Phe Gly Val Glu Thr 

50 55 60 

Ala Met He Gly Glu Ser Phe Leu Asp Lys Leu Val 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 3 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 92: 

Met Lys Lys He Phe Leu Thr Leu Leu Thr Val Ser Leu Leu Gly Gly 

15 10 15 

Ala Ser Thr Ala Val Ala Gin Asp Phe Thr He Ala Ala Lys His Ala 

20 25 30 

He Ala Val Glu Ala Asn Thr Gly Lys He Leu Tyr Glu Lys Asp Ala 

35 40 45 

Thr Pro Pro Val Glu He Ala Ser He Thr Lys Leu He Thr Val Tyr 

50 55 60 

Leu Val Tyr Glu Ala Leu Glu Asn Gly Ser He Thr Leu Ser Thr Pro 
65 70 75 80 

Val Asp He Ser Asp Tyr Pro Tyr Gin Leu Thr Thr Asn Ser Glu Ala 

85 90 95 

Ser Asn He Pro Met Glu Ala Arg Asn Tyr Thr Val Glu Glu Leu Leu 

100 105 110 

Glu Ala Thr Leu Val Ser Ser Ala Asn Ser Ala Ala He Ala Leu Ala 

115 120 125 

Glu Lys He Ala Gly Ser Glu Lys Asp Phe Val Xaa Met Met Arg Ala 
130 135 140 
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Lys Leu Leu Glu Trp Gly lie Arg He His Cys Cys Gin Tyr Asp Arg 

145 150 . 155 160 

Ser 



(2) INFORMATION FOR SEQ ID NO: 393: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 88 amino acids 

(B) . TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 93: 

Met Ser Asn Ala Met: Glu Gly Val Leu Tyr Phe Leu Lys Pro Asp Phe 

1 5 io is 

Ser Lys Leu Thr Ser Ala Asp Leu Leu Tyr Ala Leu Gly Gin Ser Phe 

20 25 30 

Phe Ala Leu Ser Leu Gly Val Thr Asp Met Leu Thr Tyr Ala Ser Tyr 

35 40 45 

Leu Asp Lys Lys Thr Asn Leu Val Gin Ser Gly He Ser He Val Thr 

50 55 60 

Met Asn He Ser He Val His His Gly Arg Ser Ser His Phe Pro Ser 
65 70 75 so 

His Val Ser Leu Gin Tyr Pro Leu 
85 



(2) INFORMATION FOR SEQ ID NO: 3 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94: 



Met Lys Val Ala Lys Pro Phe Trp Ala He Val Lys Leu Lys Ser Val 
1.5 10 15 
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Gin His Ala Ser Met 
20 

Lys Gin Val Glu Asn 
35 

Lys Glu Asn Lys Ala 
50 

Gin Thr Phe Leu Thr 
65 



Leu Glu Asn Pro Lys Glu 
25 

Leu Ala Leu Glu Asn Gin 
40 

Phe Glu Gin lie Lys Asp 
55 

lie Phe Leu Tyr Gly Cys 
70 75 
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Met Asp Gly Leu Met 
30 

Gly Tyr Gin Val Glu 
45 

Ser Val Ala Thr Phe 
60 



(2) INFORMATION FOR SEQ ID NO: 395: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

Met Asp Ala Glu Val Ser Lys Asn Leu Arg Leu lie Leu Glu Arg Lys 

1 5 . .10 15 

Gly Met Thr He Leu Thr Gly Thr Lys Leu Gin Glu He He Glu Glu 

20. 25 30 

Asn Gly Gin Leu Arg lie Lys Val Glu Gly Lys Asp Asn lie He Ala 

35 40 45 

Ser Lys Ala Leu Leu Ser He Gly Arg Met Pro Asp Leu Glu Gly He 

50 55 60 

Gly Glu Val Glu Phe Glu Leu Asp Arg Gly Cys He Lys Val Asn Glu 
65 70 75 80 

Tyr Met Glu Thr Ser Val Pro Arg He Tyr Ala Thr Arg 
85 90 



(2) INFORMATION FOR SEQ ID NO:396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



Met Lys Asn Ser lie Met Asp Thr 

1 5 
Leu Asn Lys lie lie lie Val Phe 
20 

Leu Leu Tyr He Val Val Ala Ser 

35 40 
Ser Arg Gly He Ser Phe Asn Pro 

50 55 
Gin Arg Val He Gin 
65 



Lys Phe Asp Arg Arg He Leu Leu 

10 15 
He Val Leu Met Thr Leu Leu Pro 
25 30 
Phe Met Asp Pro Lys Val Leu Val 
45 

Ala Asp Trp Thr Val Glu Gly Tyr 
60 



(2) INFORMATION FOR SEQ ID NO: 3 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

Met Val lie Pro Glu Ala Gly Tyr Leu Tyr Ala Phe His Tyr Pro Asn 

1 . 5 10 15 

Leu Lys Gly Lys Gly Lys Glu Ala Val Gin Val He Tyr Asn Leu Asn 

20 25 30 

Leu Ala Ser Ala Lys Val He Gin Leu Tyr Arg Ser Leu Gly Leu Asp 

35 .40 45 

Gly Lys He Gly He He Leu Asn Leu Thr Pro Ala Tyr Pro Arg Ser 

50 55 60 

Asn Ser Pro Glu Asp Leu Glu Ala Ser Arg Phe Thr Asp Asp Phe Phe 

65 70 75 80 

Asn Lys Val Phe Leu Glu Ser Ser Cys 

(2) INFORMATION FOR SEQ ID NO: 3 98: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 86 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:398: 



Met Thr Arg lie Ala Ser Val Ser Val lie Arg Asn Ala Ala Phe lie 

15 10 15 

Ala lie Ala Leu Ser Phe Leu Gly Lys Phe Thr Ala Leu lie Ser Thr 

20 25 30 

lie Pro Asn Ala Val Leu Gly Gly Met Ser lie Leu Leu Tyr Gly Val 

35 40 45 

lie Ala Ser Asn Gly Leu Lys Val Leu lie Lys Glu Arg Val Asp Phe 

50 55 60 

Ala Gin Met Arg Asn Leu lie lie Ala Ser Ala Met Leu Val Leu Gly 
65 70 .75 80 

Leu Gly Arg Ser Tyr Pro 
85 

(2) INFORMATION FOR SEQ ID NO: 3 99: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : singie 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQU] 

Met Ser Leu Arg 
1 

Asn Trp Thr Arg 
20 

Arg Thr Asp Phe 
35 

Val Leu Glu Glu 
50 

Ser Val Thr Glu 
65 

Phe Val Gin Ala 



INCE DESCRIPTION: 

Glu Lys Ser Met 
5 

Val Ala Val Phe 

Asp Ala Phe Val 
40 

Asp Leu Pro Leu 
55 

Glu Glu Arg Gin 
70 

Arg Pro Glu Lys 
85 



SEQ ID NO:399: 

Ser Glu Tyr Lys 
10 

Ala Gly Gly Asn 
25 

Gly Val Asp Arg 

Ala Leu Ala Val 
60 

Val lie Gin Lys 
75 

Asp Asp Thr Asp 
90 



Leu Ser Glu Asn 
15 

Arg Gly His Tyr 
30 

Gly Ser Leu Trp 
45 

Gly Asp Phe Asp 

Arg Ala Gin Tyr 
80 

Leu Glu Leu Ala 
95 
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Leu Leu Thr lie Phe Glu Gin Asn 
100 

Gly Ala Leu Gly Gly Arg lie Asp 
115 120 

Thr 
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Pro Gin Ala Glu Val Thr lie Phe 
105 no 
His Met Leu Ala Asn Val Leu Ser 
125 



(2) INFORMATION FOR SEQ ID NO: 4 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

Met Val Leu Gin Arg Asn Glu He Asn Glu Lys Asp Thr Trp Asp Leu 

1 5 io 15 

Ser Thr He Tyr Pro Thr Asp Gin Ala Trp Glu Glu Ala Leu Lys Asp 

20 25 30 

Leu Thr Glu Gin Leu Glu Thr Val Ala Gin Tyr Glu Gly His Leu Leu 

35 40 45 

Asp Ser Ala Asp Asn Leu Leu Val Asn His 
50 55 

(2) INFORMATION FOR SEQ ID NO: 4 01: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear' 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 

Met lie Tyr Gly Ala He He Asp Ala Lys Thr Ala Glu Asn Ala Ala 

1 5 10 15 

Gly Met Thr Ala Met Gin Thr Ala Thr Asp Asn Ala Lys Lys Val He 
20 25 30 
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Asn Asp Leu Thr lie Gin Tyr Asn Arg Ala Gin Thr Gly Gly Tyr Tyr 

35 40 45 

Thr Arg Asn Tyr Arg Asn Arg Ser Ser Ala 
50 55 



(2) INFORMATION FOR SEQ ID NO: 4 02: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 02: 



Met Thr Asp Gly Tyr Glu His Tyr 

1 5 
Lys Asn Leu lie Phe Pro Gly lie 
20 

lie Val Cys Ser Asn Gly Ser Phe 
35 40 



Lys Lys lie Met Glu Asn Pro Pro 

10 15 
Val Ser Pro Glu Arg Met Arg Glu 
25 30 
Leu Val Ala 



(2) INFORMATION FOR SEQ ID NO: 4 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 



Met Ser Lys Gin 
1 

lie Val Gin Ser 
20 

Thr Ala Phe Gin 
35 

Leu Asp Lys Ala 
50 



Lys Lys Phe Glu 
5 

Leu Glu Asn Gly 

Lys Gly Met Val 
40 

Glu Lys Thr Leu 
55 



Glu Asn Leu Ala 
10 

Glu lie Ala Leu 
25 

Leu Ser Lys Glu 

Val Lys Val Met 
60 



Glu Leu Glu Thr 
15 

Glu Asp Ala lie 
30 

Leu Gin Ala Thr 
45 

Gin Glu Asp Gly 
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Thr Glu Ser Asp Phe Glu 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 

Met Phe Ala Ala Val Thr Ala Gly Tyr Gin Ala Ala Leu Met Val Pro 

1 5 10 15 

Thr Glu lie Leu Ala Glu Gin His Phe Glu Ser Leu Gin Asn Leu Phe 

20 25 30 

Pro Asn Leu Lys Leu Ala Leu Leu Thr Gly Ser Leu Lys Ala Ala Glu 

35 40 45 

Lys Arg Glu Val Leu Glu Thr He Ala Lys Gly Glu Ala Asp Leu He 

50 55 60 

He Gly Thr His Ala Leu He Gin Asp Gly Val Glu Tyr Ala Arg Leu 
65 70 75 80 

Gly Leu He He He Asp Glu Gin His Arg Phe Gly Val Gly Gin Arg 

85 90 95 

Arg He Leu Arg Glu Lys Gly Asp Asn Pro Asp Val Leu Met Met Thr 

100 105 110 

Ala Thr Pro He Pro Arg Thr Leu Ala He Thr Ala Phe Gly Asp Met 

I 15 120 12 5 

Asp Val Ser He He Asp Gin Met Pro Ala Gly Arg Lys Pro Tyr Cys 

130 135 140 

Asp Ala Leu Asp Gin Thr 
I" 150 



(2) INFORMATION FOR SEQ ID NO: 40 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

Met Asn Arg Glu Ala Leu Arg Leu Tyr Leu Val Thr Asn Arg Tyr Gin 

15 10 15 

Asp Ser Val Glu Ser Phe Leu Ala Lys Val Glu Thr Ala Cys Arg Ser 

20 25 30 

Gly Val Thr He Val Gin Leu Arg Glu Lys Asn Leu Thr Thr Asn Gin 

35 - 40 45 

Tyr Tyr Gin Leu Ala Lys Gin Val Lys Glu He Thr Asp Ala Tyr Gin 

50 55 60 

Val Pro Leu He He Asp Asp Arg Leu Asp Val Cys Leu Ala Val Asp 
65 70 75 80 

Ala Ala Gly Leu His He Gly Asp Asp Glu Leu Pro Val Ser Val Ala 

85 90 95 

Arg Gin Val Leu Gly Pro Glu Lys He Pro Arg Cys His Arg 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 406: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

Met Gin Gly Ser His Phe His Leu Pro He Tyr Arg Met Thr Ser Gin 

1 5 10 15 

Ala Leu Leu Asp Glu Thr Lys Lys Val Ala He Pro Val Leu Ala Thr 

20 25 30 

Thr Gin Ser Lys Asp Ser Val Asp Tyr Arg Glu Leu Pro Ser He Glu 

35 40 45 

Asn Phe Val Leu Val Met Gly Asn Glu Gly Gin Gly He Ser Pro Leu 

50 55 60 

Met Ala Glu Ser Ala Asp Gin Leu Val His He Ser Met. Lys Gly Gin 
65 70 75 80 

Ala Glu Ser Leu Asn Val Ala Val Ala Ala Gly He Leu He Phe His 
85 90 95 

Leu Ser 
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(2) INFORMATION FOR SEQ ID NO: 407; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 



Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 

15 10 15 

Arg Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro lie Val Lys Glu Thr 

20 25 30 

lie Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu He 

35 40 45 

Asp Thr Glu Glu Asp Lys Leu Asn Gin Thr Arg Tyr Thr Gin Pro Ala 

50 55 60 

He Leu Ala Thr Ser Val Ala He Tyr Arg Leu Leu Gin Glu Lys Gly 
65 -70 75 80 

Tyr His Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 

85 90 95 

Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 

100 105 no. 

Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Gly 

^ 120 125 

Lys Met Val Ala Val Leu His Thr Pro 
130 135 

(2) INFORMATION FOR SEQ ID NO: 40 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 
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Met Tyr Gin Phe Ser Gly Gin Thr Lys Val Xaa Glu Val Leu Ala Phe 

1 5 10 is 

Arg Asp Lys Pro Pro Tyr Gly Gly Ser Ser Ala Met Pro Leu Arg Cys 
2 0 2 5 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 4 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

Met Thr Lys Glu Thr Gly Met Glu Gin Thr Phe Phe He He Lys Pro 

15 10 is 

Asp Gly Val Lys Arg Gly Leu Val Gly Glu Val Leu Lys Arg He Glu 

20 25 30 

Gin Arg Gly Phe . Thr He Glu Lys Leu Glu Phe Arg Ser Gin Val Phe 

35 40 45 

Arg Arg Val Asp 
50 

(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Thr Ser Gly Pro Val Leu Val Gly Val He Ser Gly Pro Lys Val 
1 5 io 15 



306 



80 



WO 97/37026 PCT/US97/05306 

He Glu Thr Trp Arg Thr Met Met Gly Ala Thr Arg Pro Glu GLu Ala 

20 .25 30 

Leu Pro Gly Thr He Arg Gly Asp Phe Ala Lys Ala Ala Gly Glu Asn 

35 40 45 

Glu He lie Gin Asn Val Val His Gly Ser Asp Ser Glu Lys Asn Gin 

50 55 60 

Leu Ser Arg Glu He Ala Pro Leu Val Leu Arg Val Asp Trp Leu Asn 
65 70 75 

Gin Leu Asp Lys Ser Ser Phe Glu 
85 

(2) INFORMATION FOR SEQ ID NO: 411 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s i ng 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:411: 

Met Gly Leu Lys Asp Asn Ala Gly Leu Val Lys Glu Tyr Ala Leu Val 

1 5 10 15 

Asp Ala Val Glu Tyr Gin Asn Val He Val Ala Thr Thr Val Glu Glu 

20 25 30 

Met Leu Ser Lys Tyr Ala Asn Lys Asn Asp Leu Glu He Asp Asn Ala 

35 40 45 

Thr Thr Glu Ser He Lys Gly Val Val Ala Asp Leu Lys Ser Ala Val 

50 • 55 60 

He Lys Gly Asp Thr Val Tyr Phe Phe Lys Val Asp Gly Lys lie Tyr 
65 70 75 so 

Lys val Lys Ala Ser Val Ser Asp Asp Leu Pro Tyr Leu Glu Asn Gly 

85 90 95 

Lys Thr Phe Glu Gly Gin Val Xaa Lys Asp Asn Tyr Leu Lys Thr Phe 
100 105 110 

Lys Val Gin 
115 

(2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 50 amino acids 
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(B) TYPE: amino acid 

(C) STRAND EDNESS : single 
.(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

.(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

Met lie Ser Gly Ser Glu lie Arg Asp lie Val Thr Ser Asp lie Pro 

1-5 10 15 

Leu Ala Asp Lys Thr Glu Thr Leu Val Arg Phe Ala Asn Asn Ala Gly 

20 25 30 

Gly Leu Asp Asn lie Thr Val Ala Leu Val Ser Met Asn Glu Glu Asp 
35 40 45 

Glu Glu 
50 

(2) INFORMATION FOR SEQ ID NO: 413: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 

Met Pro Asp Asn Leu Ala Leu Arg Met Arg Pro Lys Thr He Asp Gin 

1 5 10 15 

Val He Gly Gin Glu Xaa Leu Val Gly Pro Gly Lys He He Arg Arg 

20 25 30 

Met Val Glu Ala Asn Arg Leu Ser Ser Met He Leu Tyr Gly Pro Pro 

35 40 45 

Gly He Gly Lys Thr Ser He Ala Ser Ala He Ala Gly Thr Thr Lys 

50 55 60 

Tyr Ala Phe Arg Thr Phe Asn Ala Thr Val Asp Ser 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 



Met Pro Glu Glu Leu Ala Tyr Leu Val Gin His Leu Asp Asn Ala Gin 

Glu Gin Val Val Leu Gly Asn Thr Tyr His Thr Gly Asn His Cys Phe 
20 25 30 

Ser 



(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

Met Val Phe Trp Glu Gly Leu Val Pro Thr Pro Asp Val He Asp Ala 
1 5 10 



15 

r 

30 



Gly His Val Thr Gly Phe Leu Tyr Thr Val His Ser Ser Ser Thr Glv 

20 25 

Leu He Val Leu Gin He Lys Lys Asp Leu Leu Lys Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 



Met Thr Asp Arg 
1 

Cys Val Ser Gin 
20 

Asp Trp Gly Val 

35 

Asp Glu Asp Leu 
50 

Lys Leu Gin Val 
65 

He Ser Gin Phe 

Gin Val Gly Ser 
100 

Lys Ser Gly He 
115 

Asn Asn He He 
130 



Tyr Asp Ser. Ser 
5 

Gin Gin Thr Val 

Lys Tyr Pro Leu 
40 

Asp Ser Trp Arg 
55 

Phe Gly Asp Asp 
70 

Lys Asp Cys Ala 
85 

Val Ser Lys Thr 

Ser Met: Ala Phe 
120 

Ser His Leu Ala 
135 



Leu Gly He Tyr 
10 

Thr Glu He Met 
25 

Val Tyr Leu Glu 

Lys Phe Gin Leu 
60 

Phe Tyr Ala Thr 
75 

Asp Gly lie Val 
90 

Leu Glu Val Met 
105 

Ser Gin Arg Thr 

Met Ser Val He 
140 



Lys Val Pro Trp 
15 

Asp Thr Tyr Cys 
30 

Asp Pro Phe Ser 

45 . 

He Lys Pro Leu 

Asn Leu Glu Arg 
80 

He Lys Pro Asn 
95 

Glu Tyr Ala Glu 
110 

Ala Glu Thr Glu 
125 

Leu Leu He 



(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) , LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met Lys Lys He Ser Lys Gin Glu Leu Val Asn Thr Arg Phe Ser Arg 

1 5 10 15 

Leu Phe Ala His Phe Gly Gin Glu Lys Asp Gly Ser Phe Leu Ala Gin 

20 25 30 

Arg Tyr Gin Phe Tyr Leu Ala Gin Gin Gly Gin Thr Leu Ser Gly Ala 

35 40 45 

His Asp Leu Leu Asp Ser Leu He Glu Arg Asp Tyr Asn Leu Tyr Ala 

50 55 60 

Ala Thr Asn Gly He Thr Ala He Gin Thr Gly Arg Leu Ala Gin Ser 
65 70 75 80 
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Gly Leu Ala Pro Tyr Phe Asn Gin Val Phe He Ser Glu Gin Leu Gin 

85 90 95 

Thr Gin Lys Pro Asp Ala Leu Phe Tyr Glu Lys He Gly Gin Gin He 

100 105 no 

Ala Gly Phe Ser Lys Glu Lys Thr 
115 120 

(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met: His Arg Lys Thr Val He Asp Xaa Arg Thr Leu Gly Glu Arg Tyr 

1 5 10 15 

Thr Phe Thr Xaa Pro He Lys Glu Leu Lys Thr Arg Asn Val Ala Glu 

20 25 30 

Val Ala Asp Leu Leu Ala Xaa Val Glu Ser Tyr Gin Glu Gin Asp Tyr 

35 40 45 

Tyr Val Val Gly Tyr Val Ser Tyr Glu Ala Ala Pro Ala Phe Glu Glu 

50 55 60 

Lys Leu Ala Val His Lys Val Pro Leu Leu Gly Glu Tyr Leu Leu Tyr 
65 7 ° 75 ao 

Phe Thr Val His Asp Arg Val Glu Thr Ser Pro He Pro Leu Thr Tyr 

85 90 95 

Glu Asp He Asp Leu Pro Ser Asn Trp Gin Gly Xaa Asn Val Cys Thr 
100 105 no 

Glu Leu 



(2) INFORMATION FOR SEQ ID NO: 419: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

Met Glu Arg He Glu Val Arg Ser Arg Ser Gly Ser Ala His Leu Gly 

15 10 15 

H:ls Val Phe Thr Asp Gly Pro Arg Glu Leu Gly Gly Leu Ara Tyr Cys 

20 25 30 

He Asn Ser Ala Ser Leu Arg Phe Val Ala Lys Asp Glu Met Glu Lys 

35 40 45 

Ala Gly Tyr Gly Tyr Leu Leu Pro Tyr Leu Asn Lys 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 420: 

<i> SEQUENCE CHARACTERISTICS :. 

(A) LENGTH: 82 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 42 0: 

Met Glu Trp He Arg Leu He Gly He Ala He He Val Val Gly Phe 

15 10 15 

He Leu Lys Phe Asp Thr He Ala Thr Val Val Leu Ala Gly Leu Val 

20 25 30 

Thr Ala Leu Val Ser Gly Val Ser Leu Val Glu Phe Leu Glu He Leu 

35 40 45 

Gly Lys Glu Phe Ser Asn Gin Arg Val Leu Thr lie Phe Met Val Thr 

50 55 60 

Leu Pro Leu Val Gly Leu Ser Glu Thr Phe Gly. Leu Asn Asn Asp Gin 

65 ™ 75 " 80 

Ser He 



(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 5 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : singl e 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

Met Leu Leu He Phe Ala Val He Gly Val Leu Val Ala Leu He Ala 

15 10 15 

Gin Phe Tyr Ser Ala Lys Ala Ala Val Gly Phe Ala Lys Glu Leu Thr 

20 25 30 

Asn Asp Leu Tyr Arg His He Leu Ser Leu Pro Lys Asp Ser Arg Asp 

35 40 45 

Arg Leu Thr Thr Ser Ser Leu Val Thr Arg Leu Thr Ser Asp Thr Tyr 

50 55 60 

Gin He Gin Thr Gly He Asn Gin Phe Leu Arg Leu Phe Leu Arg Ala 
65 70 75 80 

Pro He He Val Phe Gly Ala He Phe Met Ala Tyr Arg lie Ser Ala 

85 90 95 

Glu Leu Thr Phe Trp Phe Leu Val Met Val Ala He Leu Thr He Val 

100 105 HO 

He Val Arg Val He Ser lie Gly Gin Ser Ser Leu Gin 
115 120 125 

(2 ) . INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

Met Pro He Glu Gin Val Gin Lys Leu Leu Gly His Ser Lys He Asp 

1 5 10 15 

Thr Thr Leu Ala Tyr Ala Met Val Asn Gin Asn Asn Val Lys His Ser 

20 25 30 

His Gin Lys Phe He Ser 
35 

(2) INFORMATION FOR SEQ ID NO: 423: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

Met Lys Pro Cys Trp Lys Lys Cys Phe Pro Glu Glu Val Ser Ser Glu 

1 5 10 15 

Gly Glu Val Thr Leu He Glu He Pro Val Ser Asp Lys He Ala Gly 

20 25 30 

Lys Gin Val His Glu Leu Asn Leu Pro His Asn Val Leu He Thr Thr 

35 40 45 

Gin Val His Asn Gly Lys Ser Gin Thr Val Asn Gly Ser Thr Arg Met 

50 55 60 

Tyr Leu Gly Asp Met He His Leu Val He Pro Lys Ser Glu He Gly 
65 70 75 80 

Lys Val Lys Asp Leu Leu Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 4 24: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 



Met Asn Glu Gly Val Glu Asn Phe 

1 5 
Ala Lys Asn He Gly He Tyr Val 
20 

Ser He Asp Thr Gly Lys Phe Thr 

35 40 
Ser Asp Ser Gly Phe Leu Glu Ser 
50 55 



Arg Ala Lys Leu Ala Ser Leu Gly 

10 15 
Gly Val Tyr Phe Met Glu Glu His 
25 30 
Ser Val Trp He Pro Ser Tyr Gly 
45 

Ser Pro Lys Thr Asp Leu Asp Tyr 
60 
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Asp lie His Gin Tyr Thr Ser Lys Gly Lys Leu Pro Ala Leu Thr Thr 
65 70 75 80 

lie 



(2) INFORMATION FOR SEQ ID NO:425: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 

Met Val Ser Lys Tyr Leu Leu Leu Ala Val lie Phe Ser Gly Leu Val 

1 - 5 10 is- 

Thr Trp He Leu Arg Met lie Ser Phe He Leu Val Lys Tyr Lys Gly 

20 25 30 

Leu Pro Ala He Val Glu Arg Phe Leu Lys Phe Leu Pro Val Ser He 

35 40 45 

He Phe Ala Leu He Leu Ser Ser Val Val Thr Gly Lys Val Gly Ser 

50 55 60 

Leu Pro Gin lie Lys Trp Leu Asp Phe Leu Ala Val Phe Pro Thr Ala 
65 70 75 80 

Trp Val Ala Phe Arg Tyr Arg Asn Leu Leu Gly Thr Val Leu Phe Gly 

85 90 95 

Val Val Leu He Ala He Leu Arg Leu Val Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 42 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 
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Met Glu Asp Xaa Glu Thr Gin Arg Val Val Met Gin Tyr Arg Ala Pro 

15 10 15 

Glu Asn Asn Arg Trp Ser Gly Tyr Ala Phe Pro Gly Gly His Val Glu 

20 25 30 

Asn Asp Glu Ala Phe Ala Glu Ser Val He Arg Glu He Tyr Glu Glu 

35 40 45 

Thr Gly Leu Thr He Gin Asn Pro Gin Leu Val Gly He Lys Asn Trp 

50 55 60 

Pro Leu Asp Thr Gly Gly Arg Tyr He Val He Cys Tyr Lys Ala Thr 
65 7 0 75 80 

Glu Phe Ser Gly Thr Leu Gin Ser Ser Glu Glu Gly Glu Val Ser Trp 

85 90 95 

Val Gin Lys Asp Gin He Pro Asn Leu Asn Leu Ala Tyr Asp Met Leu 

100 105 no 

Pro Leu Met Glu Met Met Glu Ala Pro Asp Lys Ser Glu Phe Phe Tyr 

115 120 125 

Pro Arg Arg Thr Glu Asp Asp Trp Glu Lys Lys He Phe 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 42 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 

Met Thr He Glu Leu Leu Thr Pro Phe Thr Lys Val Glu Leu Glu Pro 

1 5 10 15 

Glu He Lys Glu Lys Lys Arg Lys Gin Val Gly He Leu Gly Gly Asn 

20 25 30 

Phe Asn Pro Val His Asn Ala His Leu He Val Ala Asp Gin Val Arg 

35 40 45 

Gin Gin Leu Gly Leu Asp Gin Val Leu Leu Met Pro Glu Tyr Gin Pro 

50 55 60 

Pro His Val 
65 

(2) INFORMATION FOR SEQ ID NO: 428: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 96 amino acids 
<B) TYPE: amino acid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 42 8: 

Met Lys Lys Leu Val Phe Val Cys Leu Gly Asn He Cys Arg Ser Pro 

15 10 15 

Met Ala Glu Phe Val Met Lys Ser Met Thr Asp Asn Tyr Glu He Gin 

20 25 30 

Ser Arg Ala Thr Ser Ser Trp Glu His Gly Asn Pro He His Lys Gly 

35 40 45 

Thr Gin Gly He Phe Gin Glu Tyr Glu He Pro Tyr Asp Lys Asn Lys 

50 55 60 

Thr Ser Leu Gin He Ser Lys Glu Asp Phe Glu Ala Phe Asp Tyr He 
65 70 75 80 

He Gly Met Asp Ala Ser Asn Val Pro Thr Tyr Val Arg Cys Val Gin 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 9: 

Met Val Lys Tyr Gly Val Val Gly Ala Gly Tyr Phe Gly Ala Glu Leu 

1 5 ig 15 

Ala Arg Tyr Met Gin Lys Asn Asp Gly Ala Xaa He Thr Leu Leu Tyr 

20 25 30 

Asp Pro Asp Asn Ala Glu Ala He Ala Glu Glu Leu Gly Ala Lys Val 

35 40 45 

Ser Lys Phe Leu Arg 
50 
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(2 ) INFORMATION FOR SEQ ID NO : 430 : 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 74 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

Met Leu Trp Lys Tyr Tyr Asn Ser Ser Phe Leu Lys Ala Val Pro His 

15 10 15 

Leu Thr Thr Glu Tyr Lys Arg Leu Ala Gin Ala His Gly Leu Asn Leu 

20 25 30 

Lys Gin Ala Lys Pro He Thr Met Gly Met Trp He Gly Gly Asp Arg 

35 40 45 

Glu Gly Asn Pro Phe Val Thr Ala Lys Asn Leu Lys Gin Ser Ala Leu 

50 55 60 

Thr Gin Cys Glu Val Xaa Leu Asn Leu Leu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 4 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE. TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

Met Asn Thr Leu Tyr Leu Asn Gin His Ser Ser Tyr Lys Asn Asp Glu 

1 5 10 is 

Asp Ser His Ser Phe Pro lie Gin Met Glu Leu Val Ser Asp Glu Met 

20 25 30 

He Pro Arg Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 432: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 32: 

Met Ala Arg Lys Glu Thr Met Ala Glu Arg Lys Tyr Cys Lys Met Lys 

1 

Lys Asp lie His Pro Glu Tyr Arg Pro Val Val Phe Met Asp Thr Thr 

20 25 30 

Thr Gly Tyr Gin Phe Leu Ser Gly Ser Thr Lys Arg Ser Asn Glu Thr 



35 40 



Val Glu Phe Glu Gly Glu Thr Tyr Pro Leu He Arg Val Glu He Se- 

50 55 eo 

Ser Asp Ser His Pro Phe Tyr Thr Gly Arg Gin Lys Phe Thr Gin Ala 



80 



65 ™ 75 

Asp Gly Arg Val Asp Arg Phe Asn Lys Lys Tyr Gly Leu Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 433 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s i ng 1 e 

( D ) TOPOLOGY .- 1 inear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:433: 
Met Glu Thr Ala Leu He Ser Val He Val Pro Val Tyr Asn Val Ala 

1 5 " 15 

Gin Tyr Leu Glu Lys Ser He Ala Ser He Gin Lys Gin Thr Tyr Gin 

20 25 30 

Asn Leu Glu lie He Leu Val Asp Asp Gly Ala Thr Asp Glu Ser Gly 

35 4 0 45 

Arg Leu Cys Asp Ser lie Ala Glu Gin Asp Asp Arg Val Ser Val Leu 
50 55 



319 



WO 97/37026 PCT/US97/05306 

His Lys Lys Asn Glu Gly Leu Ser Gin Ala Arg Asn Asp Gly Met Lys 
65 70 75 80 

Pro Gly Ser Arg Gly Leu Ser Asp Phe Tyr 
85 90 



(2) INFORMATION FOR SEQ ID NO: 4 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434; 



Met Tyr Phe Leu lie Asn Phe Val Tyr Pro Val Asp Met Val lie Asn 

1 5 10 15 

Leu Pro Phe Leu lie Asn Thr Gly Leu lie Val Leu Leu Ser Ala He 

20 25 30 

Ser Tyr He Ser Leu Leu Val Phe Thr Lys Asp Ser He Phe Tyr Glu 

35 40 45 

Phe Leu Asn His Val Leu Ala Leu Lys Asn Lys Phe Lys Lys Ser 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 43 5: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

Met Arg Val He Ala Trp Leu Glu Pro Ser Arg Lys He Leu Gin Pro 

1 5 10 15 

Gly Ala Asn Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu 

20 25 30 

Leu Leu Pro Tyr He Pro Pro Val Val Leu Asn Val Asp He Pro Asn 
35 40 . 45 
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Lys Arg Val Asp Val Glu He Leu Glu Gly Leu Asp Asp Glu Asp 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 43 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 6: 

Met Asp Asp Asp Phe Lys Arg Tyr Asn Ala Pro He Leu Thr Trp Tyr 

1 5 10 15 

Glu Thr Ala Arg Tyr Ala Phe Glu Arg Gly Met Val Trp Gin Asn Leu 

20 25 30 

Gly Gly Val Glu Asn Ser Leu Asn Gly Gly Leu Tyr His Phe Lys Glu 

35 40 45 

Lys Phe Asn Pro Thr He Glu Glu Tyr Leu Gly Glu Phe Thr Met Pro 

50 55 60 

Thr His Pro Leu Tyr Pro Leu Leu Arg Leu Ala Leu Asp Phe Arg Lys 

65 70 75 3o 

Thr Leu Arg Lys Lys His Arg Lys 

85 

(2) INFORMATION FOR SEQ ID NO: 43 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:437: 
Met Glu Asp Phe Phe Ala Trp Cys Arg Arg Gin Ser Val Leu Ser Gly 
Ser Lys Leu Gly Arg Ala He Glu Tyr Ser Leu Lys Tyr Glu Glu Thr 
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Phe Lys Thr lie Leu Lys Asp Gly His Leu Val Leu Ser Asn Asn Leu 

3 5 40 45 

Ala Glu Arg Ala He Lys Ser Leu Val Met Gly Arg Ser Lys Arg Val 

50 55 60 

Gin Trp Thr Leu Leu Ala 
65 70 

(2) INFORMATION FOR SEQ ID NO: 43 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 8: 

Met Thr Lys Gin Gly Ser Trp Lys Leu He Leu Glu Asp Glu Ser Ala 

1 5 10 15 

Gly Arg Glu Leu Phe Pro lie Leu Thr Gin Gly Xaa Tyr He Ala Thr 

20 25 30 

Phe Asp Gin Gin Ala Pro Xaa He Asp Glu He Phe Lys Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 9: 

Met Thr Thr Asn Xaa Asp Asn Asp Thr Ser Glu Arg Ser Asp Cys Tyr 

15 10 15 

Arg Lys Arg Thr Ala Gly Glu Thr Pro Met Asn Asp Leu Thr Ser His 

20 25 30 

Thr His Gly Gly Asn Tyr Thr He Ala Arg Tyr Gin Glu Glu Lys Phe 
35 40 45 
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Trp Asn Lys Gin Leu 
50 
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(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 40: 

Met Ala Gin His Ala Ala Val Glu Ala Leu Thr Ala Gly Lys Asn Asp 

Ala Glu Pro Met Lys Lys Glu Tyr He Gin Arg Arg Asp Tyr He He 

20 25 30 

Glu Lys Met Thr Ala Leu Gly Phe Glu He He Lys Pro Asp Gly Ala 

35 40 45 

Phe Tyr He Phe Ala Lys He Pro Ala Gly Tyr Asn Gin Asp Ser Phe 

50 55 go 

Ala Phe Leu Lys Asp Phe Ala Gin Lys Lys Ala Val Ala Phe He Pro 
65 70 75 80 

Gly Ala Ala Phe Gly Arg Tyr Gly Glu Gly Tyr Val Arg Leu Ser Tyr 

85 90 95 

Ala Ala Ser Met Glu Thr He Lys Glu Ala Met Lys Arg Leu Glu Glu 

100 105 110 

Tyr Met Arg Glu Ala 

115 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 
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Met Asn lie Thr Asn Leu Phe Ser lie Lys Thr Gly Cys Asp Glu Thr 

1 5 10 15 

Asp Arg Gin Leu Gin Lys Leu Phe Phe Gin Leu Asp Leu Gin Leu Gly 

20 25 30 

Glu Leu Thr Asp Gin Leu Arg Lys Leu Asp Ser Asn Phe Val Pro Arg 

35 40 45 

Ser Gin Phe Val Asp Thr Leu Asp Leu Asn Asp Val Glu Tyr Lys Glu 

50 55 60 

lie Leu Asn Tyr Phe lie Phe His Arg Asn Asp Ser Glu Glu Ser Leu 
65 70 75 80 

Val Glu Trp Leu Tyr Asp Trp lie Ser Thr Asn Arg Tyr Glu Leu Pro 
85 90 95 

(2) INFORMATION FOR SEQ ID NO:442: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 

Met Leu Gin Trp lie Asp Arg Thr Phe Gin Asp Tyr Leu Glu Leu Glu 

15 10 15 

His Leu Val Leu Thr Thr Trp Ser Gly Asn Leu Gly Met Met Lys Leu 

20 25 30 

Ala Glu Lys Leu Xaa Met Lys Lys Xaa Ala His lie Xaa Lys Val Arg 

35 40 45 

Tyr Tyr Gin Gly Lys Tyr Phe Asp Ser lie Lys Tyr Gly lie Leu Arg 

50 55 60 

Xaa Asp Trp Glu Lys lie Asn Asp Gly Tyr Tyr Gin lie Tyr Xaa Asn 
65 70 75 80 

Ser 



(2) INFORMATION FOR SEQ ID NO:443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single. 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

Met lie Val Glu Glu Val Arg Asp Arg Tyr Val Gly Lys Val Asp Ala 

1 5 10 15 

Val Phe His Asn Gly Asp Ser Glu Leu Arg Pro Asp Ser Pro Leu Trp 

20 25 30 

Glu Gly He Arg Val Val Lys Gly Asn Met Asp Phe Tyr Ala Gly Tyr 

35 40 45 

Pro Glu Arg Leu Val Thr Glu Leu Gly Ser Thr Lys lie He Gin Thr 

50 55 60 

His Gly His Leu Phe Asp He Asn Phe Asn Phe Gin Lys Leu Asp Tyr 
65 7 ° 75 80 

Trp Ala Gin Glu Glu Glu Ala Ala He Cys Leu Tyr Gly His Leu His 

85 90 95 

Val Pro Ser Ala Trp Leu Glu Gly Lys He Leu Phe Leu Asn Pro Gly 

100 105 no 

Ser He Ser Gin Pro Arg Gly Thr. He Arg Glu Cys Leu Tyr Ala Arg 

115 120 125 

Val Glu He Asp Asp Ser Tyr Phe Lys Val Asp Phe Leu Thr Arg Asp 

130 135 140 

His Glu Val Tyr Pro Gly Cys Pro Arg Ser Leu Ala Asp Asp Ala Lys 
145 150 155 160 

Glu Leu Arg Leu Ser Val Gly Ala Glu Asn Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

Met Asn Leu His Gin Pro Leu His Val Leu Pro Gly Val Gly Pro Lys 
1 5 10 15 
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Ser Ala Glu Lys Tyr Ala Lys Leu 
20 

Leu Leu Tyr Phe Pro Phe Arg Tyr 

35 40 
Leu Glu Leu Glu Asp Gly Glu Lys 

50 55 
Thr Pro Ala Ser Val Gin Tyr Tyr 
65 70 
Ala Phe Leu Val Leu Gin Ala Arg 
85 

Gly 
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Gly He Glu Asn Leu Gin Asp Leu 
25 30 
Glu Asp Phe Lys Thr Lys Gin Val 
45 

Ala Val Leu Ser Gly Gin Val Val 
60 

Gly Phe Gin Ala Ala Asn Pro Pro 

75 80 
Glu Glu Gly Ser Phe Phe Phe Gly 
90 95 



(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Met Gin Leu Leu Ser Glu Asn Met Leu Lys Thr lie Gin Ser Leu Ser 

1 5 10 15 

Val Trp Gin lie Tyr Leu Leu Gly Phe Glu Arg lie Leu Ala Leu Gly 

20 25 30 

Phe Gin Leu Leu Leu Thr Val -Trp Val Tyr Gin Ala Val Arg Gin Lys 

35 40 45 

Lys Trp He Tyr Leu Leu Ala Ala Tyr Gly Leu His Ala Phe Phe Asp 

50 55 60 

Leu Ala Pro Ser Leu Phe Gin Val Gly Trp Leu Thr Asn Pro Val Leu 
65 70 75 80 

Val Glu Val He Leu Ala Leu Glu Leu Val Leu Val Ala Tyr Gly Thr 

- 85 90 95 

Lys Glu He Phe Cys Lys Lys Ser 
100 

(2) INFORMATION FOR SEQ ID NO: 44 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 122 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 6: 

Met Thr Lys Ser Met Thr Pro Asp Arg Glu Val He Thr Phe He Pro 

1.5 10 15 

Glu Lys Phe He Val Asp Gly Phe Gin Gly He Arg Asp Pro Arg Gly 

20 25 30 

Met Met Gly Val Arg Leu Glu Met Arg Gly Leu Leu Tyr Thr Gly Pro 

35 40 45 

Arg Thr He Leu His Asn Leu Arg Lys Thr Val Glu Arg Ala Gly Val 

50 55 60 

Gin Val Glu Asn Val He He Ser Pro Leu Ala Met Val Gin Ser Val 
65 7 0 75 QO 

Leu Asn Glu Gly Glu Arg Glu Phe Gly Ala Thr Val He Asp Met Gly 

85 90 95 

Ala Gly Gin Thr Thr Val Ala Thr He Arg Asn Gin Glu Leu Gin Phe 

100 105 i 10 

Thr His He Leu Gin Glu Val Glu He Met 
115 120 

(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 447: 

Met Leu Asn Ser He Val Thr He He Cys He Ala Leu He Ala Phe 

1 5 10 15 

He Leu Phe Trp Phe Phe Lys Lys Pro Glu Lys Ser Gly Gin Lys Ala 

20 25 30 

Gin Gin Lys Asn Gly Tyr Gin Glu He Arg Val Glu Val Met Gly Gly 
3 5 40 45 
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Tyr Thr Pro Glu Leu He Val Leu Lys Lys Ser Val Pro Ala Arg He 

50 55 60 

Val Phe Asp Arg Lys Asp Pro Ser Pro Cys Leu Asp Gin He Val Phe 



Val Glu. He Thr Pro Glu Gin Ala Gly Glu Phe Gly Phe Ala Cys Gly 

100 105 no 

Met Asn Met Met His Gly Lys Met lie Val Glu 



(2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

Met Lys He Val Ser Gly He Tyr Gly Gly Arg Pro Leu Lys Thr Leu 

1 . . 5 10 15. 

Glu Gly Lys Thr Thr Arg Pro Thr Ser Asp Lys Val Arg Gly Ala He 

20 25 30 

Phe Asn Met lie Gly Pro Tyr Phe Glu Val Gly Xaa Val Leu Asp Leu 

35 40 45 

Tyr Ala Gly Ser Gly Gly Leu Ser He Glu Ala Val Ser Arg Gly Met 

50 55 60 

Ser Ser Ala Val Leu Val Glu Arg Asp Arg Lys Ala Gin Thr He Val 
65 70 75 80 

Ala Glu Asn He Gin Met Thr Lys Glu Val Gly Lys Phe Gin Leu Leu 

85 90 95 

Gin Asn Gly Cys Arg Lys Gly He Gly Thr Gly Xaa Leu Gly Asn Leu 

100 105 no 

Thr Ser Phe Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 amino acids 



65 




Pro Asp 



Phe Gly Val His Ala Asn Leu Pro Met Gly Glu Glu Tyr Val 
85 90 95 



115 



120 
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(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 : 

Met Xaa Thr Asn Phe Thr Asp Gin Val Asp Thr Met He Tyr Val Asp 

1 5 10 15 

Lys Glu Glu Lys Glu Thr He Lys Ala Ala Leu Val Glu Phe Phe Asn 

20 25 30 

Gly Lys Val Thr Leu Thr Asp Gin Gly Leu Arg Glu Val Glu Val Pro 

35 40 45 

Val Asn Leu Val 
50 

(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) ' MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 50: 

Met He Gly Val Ala Leu Glu Glu Ser Pro Thr Leu Lys He His His 

1 5 . 10 15 

Leu Thr His Asp He Thr Pro Tyr Asn He Phe Glu Gly Ser Tyr Arg 

20 25 30 

Leu Phe Gin Thr Val Asp Tyr Trp Pro Glu Gly Thr Thr Phe Val Ser 

35 40 45 

Val Val Asp Pro Gly Val Gly Ser Lys Arg Lys Ser Val Val Ala Lys 

50 55 60 

Thr Ala Gin Lys Ser lie His Cys His Ala Arg 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 4 51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 0 amino acids 
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{ B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 : 



Met Glu Leu Arg Arg Pro Arg Leu Ala Asp Lys Lys Ala Val Leu Asp 

1.5 10 15 

Met Met Thr Glu Phe Glu Lys Ser Gin Ser Ala His Asp Gly Gly Phe 

20 25 30 

Trp Asp Thr Glu Asn Phe Val Tyr Glu Glu Trp Leu Glu Ser Asn Gin 

35 40 45 

Glu Gin Glu Met Gly lie Asn Leu Pro Glu Gly Trp Val Pro Ala lie 

50 55 60 

Gin Leu Val Ala Phe Ser Glu Lys Gly Gin Ala Val Gly Phe Leu Asn 
65 70 75 80 

Leu Arg Leu Arg Leu Ser Asn Phe Leu Leu Glu Glu Gly Gly His lie 

85 90 95 

Gly Tyr Ser lie Arg Pro Ser Glu Arg Gly Lys Gly Tyr Ala Lys Glu 

100 105 110 

Leu Ser Val Arg Ala Cys Lys Leu Leu Arg Lys Arg Thr Ser Arg Lys 
115 120 125 

Leu Trp 
130 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 

Met Thr Phe lie Arg Gly Pro lie lie Ser Thr Val Gly Glu Gly Val 

1 5 10 15 

Xaa lie Leu Ala Thr Val Asn Xaa Gin Phe Val Xaa Xaa Gin Xaa Xaa 

20 25 30 
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Asn Met Leu Val Ser Pro Phe His Pro Glu Leu Thr Asp Xaa Xaa Arg 

35 .40 45 

Leu Xaa Gin Tyr Phe lie Ser Met Cys Lys Glu Lys Ser Xaa Asp 
50 55 eo 

(2) INFORMATION FOR SEQ ID NO: 453 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 3: 
Met Lys Gly Asp Val He Thr Glu His Gin Phe Tyr Glu Gin Val Lys 



1 5 10 



15 



Asn Asn Pro Ser Ala Gin Xaa Val Leu Leu Asn Met Thr He Gin Lys 

20 25 30 

Val Phe Glu Lys Thr He Trp Leu Arg Ala 
35 40 

(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 

Met Asp Lys Ala Leu Ala Asp Leu Lys Thr Ser Gly His Leu Pro He 

Pro Arg His Leu Arg Asp Gly His Tyr Xaa Gly Ser Lys Glu Leu Gly 

20 25 30 

Asn Ala Gin Asp Tyr Leu Tyr Pro His Asn Tyr Pro Xaa Asn Trp Val 

35 4 ° 45 

Lys Gin Asp Tyr Leu Pro Gin Lys lie Arg Asn His His Tyr Phe Gin 
50 " 60 
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Ala Glu Tyr Thr Gly Lys Tyr GLu Arg Ala Leu Ala Gin Arg Lys Glu 
65 70 75 80 

Ala lie Asp His Leu Arg Lys lie 
85 



<2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 55 : 

Met Val Val Phe Thr Gly Ser Thr Val Glu Glu Ala lie Gin Lys Gly 

15 10 15 

Leu Lys Glu Leu Asp lie Pro Arg Met Lys Ala His lie Lys Val lie 

20 25 30 

Ser Arg Glu Lys Lys Gly Phe Leu Gly Leu Phe Gly Lys Lys Pro Ala 

35 40 45 

Gin Val Asp lie Glu Ala He Ser Glu Thr Thr Val Val Lys Ala Asn 

50 ,55 60 

Gin Gin Val Val Lys Gly Val Pro Lys Lys He Asn Asp Leu Asn Glu 
65 70 75 80 

Pro Val Lys Thr Val Ser Val Arg Asn Arg 
85 90 



(2) INFORMATION FOR SEQ ID NO: 45 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 



Met Thr Lys Arg Cys Ser Trp Val Lys Met Thr Asn Pro Leu Tyr He 
15 10 15 
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Ala Tyr His Asp Glu Glu Trp Gly Gin Pro Leu His Asp Asp Gin Ala 

20 25 30 

Leu Phe Glu Leu Leu Cys Met Glu Thr Tyr Gin Ala Gly Leu Ser Trp 

35 40 45 

Glu Thr Val Leu Asn Lys Arg Gin Ala Phe Arg Glu Ala Phe Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

Men Phe Val Gly Gin Glu Trp Thr Asn Gin Thr Phe Val Asp Leu Leu 

1 5 io 15 

Gly Asn His Gin Gly Gin Val Thr He Asp Glu Glu Gly Tyr Gly Gin 

2 ° 25 30 

Phe Pro Val Ser Ala Arg Ser Val Ser Val Trp Ala Val Asn Thr He 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 58: 

Met Ala Gin Val Asp He He His Gly He Gly Thr Gly Val He Arg 

1 • 5 10 is 

Glu Gly Val Thr Lys Tyr Leu Gin Arg Asn Lys His Val Lys Ser Phe 

20 25 30 

Gly Tyr Ala Pro Gin Asn Ala Gly Gly Ser Gly Ala Thr He Val Thr 
35 40 45 
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50 
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(2) INFORMATION FOR SEQ ID NO: 459: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 



Met Asn Leu Leu Glu Glu Ala Thr Lys Gly Lys Val lie Phe Glu Gly 

15 10 15 

Val Asp lie Thr Asp Lys Lys Asn Asp Leu Phe Pro Met Arg Glu Lys 

20 25 30 

Met Gly Met Val Phe Gin Gin Phe Asn Xaa Phe Leu lie 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 460: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 9 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 60: 

Met Gin lie Asp Leu Ala Asn Leu Leu Pro Asp Leu Pro Gly Asn Leu 

15 10 15 

Ser Gly lie Ash Pro Asn Arg Tyr Val Phe Tyr Gin Asp Val Leu Cys 

20 25 30 

Pro lie Leu Asp Arg His Met Thr Pro Glu Gin Asp Lys Pro His Phe 

35 40 45 

Ala Gin Ala Ala Gly Thr lie Ala Asp lie Lys Glu Lys Ala Gly Asn 

50 55 w 60 

Tyr Ala Tyr Leu Phe Glu Thr Gin Ala Gin Leu Asn Ala lie Leu Ser 
65 70 75 80 
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Ser Lys Val Asp Val Gly Arg Arg He Arg His Ala Tyr Gin Ala Asp 

85 90 95 

Asp Lys Glu Ser Leu Gin Gin He Ala Arg Gin Glu Leu Pro Glu Leu 

100 105 il0 

Arg Ser Gin He Glu Asp Phe His Ala Leu Phe Ser His His Trp Leu 
115 120 125 

Lys 



(2) INFORMATION FOR SEQ ID NO: 4 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 



Met Arg Gin Gly He Asp Ser Leu Ala Tyr Val Val Lys Thr His Phe 

Glu Leu Asp Pro Phe Ser Gly Gin Ala Phe Leu Phe Cys Gly Gly Arg 

20 25 30 

Lys Asp Arg Phe Lys Ala Leu Tyr Trp Asp Gly Gin Gly Phe Trp Leu 

35 40 45 

Leu Tyr Lys Arg Phe Glu Asn Gly Lys Leu Thr 
50 55 



(2) INFORMATION FOR SEQ ID NO: 462: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 

Met Asn Ala Phe Leu Glu Glu Leu Ser Gin Ala Tyr Pro Asp Asp Tyr 
1 5 10 15 
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Phe Leu Leu Val Met Asp Asn Ala lie Trp His Lys Ser Ser Val Leu 

20 25 30 

Lys lie Pro Thr Asn lie Gly Phe Ala Phe lie Pro Pro Tyr Thr Pro 

35 40 45 

Glu Met Asn Pro He Glu Gin Val Trp Lys Glu He Arg Lys Arg Gly 

50 55 60 

Phe Lys Asn Lys Ala Phe Arg He Leu Glu Asp Val Met Asn Gin Leu 
65 70 75 80 

Gin Asp Val He Gin Gly Leu Glu Lys Glu Val lie Lys Ser He Val 

85 90 95 

Asn Arg Arg Trp Thr Arg Met Leu Phe Glu Ser Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

Met Thr Val Glu Glu Glu Lys Val Phe Leu Ala Arg His Leu Lys Ala 

15 10 15 

Thr Glu Ala Gly Glu Phe Val Thr He Asp Ala Leu Phe Gin Ala Tyx 

20 25 30 

Lys Lys Glu Leu Gly Arg Ser Tyr Thr Arg Asp Ala Phe Tyr Gin Leu 

35 40 45 

Leu Lys Arg His Gly Trp Arg Asn He Met Pro Arg Pro Glu His Pro 

50 55 60 

Lys Lys Ala Asp Ala Gin Thr He Val Ala Ser Lys Asn Lys He Ser 
65 70 75 80 

He Gin Glu Asp Lys Lys Ala He 
85 

(2) INFORMATION FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

Met Leu Ser Leu Val Val Leu He Thr Gin Tyr Ala Thr His Gly Glu 

1 5 10 15 

Leu Phe Glu Asn Tyr lie Ser Lys Glu Asn Asp Asn Val He Lys Glu 

. 20 25 30 

Tyr Gin Asn Thr Thr Gly Phe Ser Thr Pro Tyr Thr Leu Asp Gly Ser 

35 40 45 

Val Leu He Val His Pro Asp Leu Thr Lys Gly Met Asn He Glu Gly 

50 55 go 

Tyr Ser Asp Leu Leu Lys Pro Glu Leu Lys Gly Lys He Ala Thr Ala 
65 70 75 80 

Asp Pro Ala Asn Ser Ser Ser Ala Phe Ala Gin Leu Thr Asn Met Leu 

85 90 g5 

Gin Ala Gin Gly Gly Tyr Lys Asp Asp Leu Ala Trp Ser Val Leu Ser 

100 105 no 

Thr He Phe Ser Thr Leu He Asp Gly He Val Lys 
115 120 

(2) INFORMATION FOR SEQ ID NO : 4 6 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 : 



Met Leu Ala Arg Lys Asp Arg Phe 

1 5 ; 
Lys Ala Leu Gly He Phe Leu Ser 
20 

He Glu Val Gly Ala Val Thr Phe 
35 40 



Tyr Tyr lie Asp Lys Ala Thr Gly 

10 15 
Cys Val Leu Thr Ser Asn Gly Val 
25 30 
Ser Pro Lys Leu Arg Gly Thr Arg 
45 
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lie Gly Thr Glu Ala Gin Tyr Leu Leu Ala Arg Tyr Val Phe Glu Glu 

50 55 60 

Leu Asn Tyr Arg Arg Tyr Glu Trp Lys Cys Asp Ala Leu Asn Leu Pro 
65 70 75 80 

Ser Arg Arg Ala Ala Glu Arg Leu Gly Phe He Tyr Glu Gly Thr Phe 

85 90 95 

Arg Gin Ala Val Val Tyr Lys Gly Arg Thr Arg Asp Met Asp Trp Leu 

100 105 no 

Ser Met He Asp Lys Asp Trp Pro Lys Val Lys Asp Arg Leu Glu Thr 

115 120 125 

Trp Leu Arg Pro Glu Asn Phe Asp Lys Asn Gly Gin Gin Tyr Lys Ser 

130 135 140 

Leu Arg Glu Leu 
145 



(2) INFORMATION FOR SEQ ID NO: 466: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 



Met Met Thr Ser Asp Phe Phe Glu Val Trp Phe Gin Lys Phe Leu Leu 

1 5 10 15 

Pro Thr Leu Thr Thr Pro Ser Val lie He Val Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 
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Met Val Ser Thr Lys Thr Gin He Ala Gly Phe Glu Phe Asp Asn Cys 

1 5 10 is 

Leu Met Asn Ala Ala Gly Val Ala Cys Met Thr lie Glu Glu Leu Glu 

20 25 30 

Glu Val Lys Asn Ser Ala Ala Gly Thr Phe Val Thr Lys Thr Ala Thr 

35 40 45 

Leu Asp Phe Arg Gin Gly Asn Pro Glu Pro Arg Tyr Gin Asp Val Pro 

50 55 60 

Leu Gly Ser He Asn Ser Met Gly Leu Pro Asn Asn Gly Leu Asp Tyr 
65 70 75 80 

Tyr Leu Asp Tyr Leu Leu Asp Leu Gin Glu Lys Glu Ser Asn Arg Thr 

85 90 95 

Phe Phe Leu Ser Leu Val Gly Met Ser Pro Glu Lys Pro He Leu Phe 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 46 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 

Met Thr Tyr Leu Met Phe Ser Gly Leu Asp Glu Asn Phe Tyr His Phe 

15 10 15 

Pro Trp Glu Val Phe Ala Gly Phe Gly He He Phe Leu Ala Cys Pro 
20 25 30 

Glu Lys Val 
35 

(2) INFORMATION FOR" SEQ ID NO: 469: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 ainino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 
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Met Phe Thr Gly Trp Phe Ala Gin 

1 5 
Gly Ala Trp Gly Leu Gly Phe Met 
20 

Lys Glu Leu Leu Glu Gly Glu Tyr 
35 40 



Thr Phe Leu His Gin Phe lie Arg 

10 15 
lie Phe lie Ala Phe Pro Met Gly 
25 30 
His Glu His Asp 



(2) INFORMATION FOR SEQ ID NO:470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 0: 



Met Gin Glu Lys lie Leu Val Thr Gly Gly Ala Gly Phe lie Gly Thr 

15 10 15 

His Thr Val lie Glu Leu lie Gin Ala Gly His Gin Val Val Val Val 

20 25 30 

Asp Asn Leu Val Asn Ser Asn Xaa Lys Ser Leu Glu Val Val Glu Arg 

35 40 45 

lie Thr Gly Val Glu lie Pro Phe Tyr Glu Ala Asp lie Xaa Asp Thr 

50 55 60 

Asp Thr Leu Arg Asp lie Phe Lys Gin Glu Glu Leu Thr Gly Val lie 
65 70 75 80 

His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Thr Arg lie Pro Leu 

85 90 95 

Ala Tyr Tyr Asp Asn Asn lie Ala Gly Thr Val Ser Leu Leu Lys Ala 

100 105 110 

Met Glu Glu Asn Asn Cys Lys Asn lie lie Phe Ser Ser Xaa Ala Thr 

115 120 125 

Val Tyr Gly Asp Pro Tyr Thr Val Pro lie Leu Glu Val Ser His Phe 
130 135 140 

Gin 
145 

(2) INFORMATION FOR SEQ ID NO: 471: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

Met Val Lys Glu Lys Val Thr Leu Thr Pro Leu Ala Ser Gly Tyr Gin 

lie Gly Glu Glu Glu Phe Glu Gin Val lie Leu Ala Xaa Gly Ala Trp 

20 25 30 

Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr Glu Val Asp Val Arg Pro 

35 40 45 

Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu Val Gin Asp Met Glu Ala 

50 55 60 

Tyr Pro Val Val Met Pro Glu Gly Glu Trp Asp Leu lie Pro Phe Ala 
65 70 75 80 

Gly Gly Lys Leu Ser Leu Gly Ala Thr His Glu Asn Asp Met Gly Phe 

85 90 95 

Asp Leu Thr Val Asp Glu Thr Leu Leu Gin Gin Met Glu Glu Ala Thr 

100 105 110 

Leu Thr His Tyr Leu lie Leu Ala Glu Ala Thr Ser Lys Ser Glu Arg 

H5 120 125 

Val Gly lie Arg Ala Tyr Thr Ser Asp Phe Ser Pro Phe Phe Gly Gin 

I 30 135 140 

Val Pro Asp Leu Thr Gly Val Tyr Ala Ala Ser Gly Leu Gly Ser Ser 
145 I 50 155 ' i60 

Gly Leu Thr Thr Gly Pro lie He Gly Tyr His Leu Ala Gin Leu lie 

l g 5 170 175 

Gin Asp Lys Glu Leu Thr Leu Asp Pro Glu Asn Tyr Pro He Glu Asn 

180 185 190 

Tyr Val Lys Arg Val Lys Ser Glu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

Met Glu His lie lie Tyr Gin Phe Glu Glu Asp Leu Ala lie Leu Thr 

15 10 is 

Leu Asn Arg Pro Glu Val Ala Asn Gly Phe His lie Pro Met Cys Glu 

20 25 30 

Glu lie Leu Glu Ala Leu Thr Leu Ala Glu Glu Asn Pro Ala Val His 

35 40 45 

Phe He Leu He Asn Ala Asn Gly Lys Val Phe Ser Val Gly Gly Asp 

50 55 60 

Leu Val Glu Met Lys Arg Ala Val Asp Glu Asp Asp He Pro Ser Leu 
65 70 75 80 

Thr Lys He Ala Glu Leu Val Asn Thr He Ser Tyr Lys lie Lys Gin 

85 90 95 

He Ala Lys Pro Val Phe Asn Gly Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: 

Met Gin Pro Glu Val He Trp Leu Gin Lys He Ala Lys Pro Ser Gin 

15 10 15 

Met Lys Val Leu Asn Thr Thr Asp Arg Val Gin Ala He Lys Asp Asp 

20 25 30 

Val Asp He He Gin Asn Ser Leu Gin He lie Asn Gin Gin Lys Glu 

35 40 45 

Leu He Lys Glu Tyr Gin Glu Asp Leu Thr Tyr Lys Phe Lys Val Leu 

50 55 60 

Glu Lys Asp He Gin Thr Arg Thr Ser Cys Asp Lys Arg Asn Ala Gly 
65 70 75 80 

Asn 
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(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 



Met Gly Ser Asn Ser Leu Thr Leu Leu Leu Lys lie Gin Ala Thr lie 

1 5 10 15 

Pro Arg Asn He Tyr Asp Asn Leu Gin Ala Gly Ser Lys Val Thr Leu 

20 25 30 

Asp Arg Ala Tyr Gly His Met He He Glu Glu Gly Arg Glu Asn Gin 

35 40 45 

Val Trp He Ala Gly Gly He Gly He Thr Pro Phe He Ser Tyr He 

50 55 60 

Arg Glu His Pro He Leu Asp Lys Gin Val His Phe Tyr Tyr Ser Phe 
65 7 0 75 * ~ so 

Arg Gly Asp Glu Asn Ala Val Tyr Leu Asp Leu Leu Arg Asn Tyr Ala 

85 90 95 

Gin Lys Ash Pro Asn Phe Glu Leu His Leu He Asp Ser Thr Lys Asp 

100 105 no 

Gly Tyr Leu Asn Phe Glu Gin Lys Lys Cys Pro Asn Met Gin Pro Ser 

115 120 125 

He Cys Val Val Leu Phe Leu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 5: 
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Met Phe Phe Val Leu 

1 5 
Lys Leu His Pro He 
20 

Lys Asp Val Asp Phe 

35 

Asp Gin Asp Phe Leu 
50 

Gin Ala Ser Lys Val 
65 

Gin lie Leu Ser Asn 
85 

Glu Met Ala Val Arg 
100 

Glu Glu Leu Val Glu 
115 

Val Arg Arg Leu Leu 
130 

Leu Pro Glu Ala lie 
145 

His Leu Lys Ser Leu 
165 

Lys Glu Pro Trp Asp 
180 

Leu Gly Lys Pro Ser 
195 

Arg lie Glu Thr Asn 
210 



Ala Tyr Ala Lys Ala Val 
10 

Gin Arg Gin Gly Ala Gly 
25 

Ala Ser Ala Thr Ala Leu 
40 

Glu Arg Phe Met Pro Ser 
55 

He Trp Glu Asp Tyr Phe 
70 75 
Pro Asp Leu Thr Thr He 
90 

lie Lys Glu Ala He Lys 
105 

Leu Val Thr Thr Lys Arg 
120 

Thr Tyr He Leu Met Gin 
135 

His Val Leu Gly Phe Thr 
150 155 
Lys Gly Gin Val Asn Leu 
170 

Ala Met Thr Gin Lys Val 
185 

He Ala Glu Gin Asn Phe 
200 
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Ala Gly Arg Asn He 
15 

Tyr His Ser Val Asn 
30 

Arg Gin His Gin Lys 
45 

Val Ala Leu Phe Glu 
60 

Pro Leu Leu Arg Tyr 
80 

Tyr Gin Val Asn Gin 
95 

Thr Ala Gin Ser Val 
110 

Tyr Thr Lys Ala Arg 
125 

Ala Arg Glu Ser Asp 
140 

Glu Lys Gly Arg Gin 
160 

Val Ser Arg He Gly 
175 

Asp Gin He Tyr Gin 
190 

Gly Arg Val Pro He 
205 



(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 6: 

Met Leu Gly Ser Met Phe Val Gly Leu Leu Val Gly Phe Leu Ala Gly 
15 10 15 
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Ala Met Thr Asn Arg Gly Glu Arg 
20 

Leu Gly Trp lie Gly Ala Phe Leu 

35 40 
Gly Pro Val Leu Ser Gly Thr Ala 

50 55 
Met lie Val Leu Ala lie Phe Trp 
65 70 
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Met Gly Cys Phe Gly Lys Met Phe 
25 30 
Gly His Leu Leu Phe Gly Thr Trp 
45 

He He Pro Ala Val Leu Gly Ala 
60 

Arg Arg Gly Ser 
75 



(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

Met Arg Ser Ser Pro Ser Arg Glu He Gin Pro Arg Ser Trp Gly He 

15 10 15 

Thr Thr Val Tyr Val Thr His Asp Gin Glu Gly Ala Met Ala He Ser 

20 25 30 

Asp Gin He Ala Cys Tyr Glu Arg Trp Gly Asp Pro Thr Asn Arg Pro 

35 40 45 

Thr Lys Arg Thr Val Ser 
50 
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WHAT IS CLAIMED IS: 
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1 . An isolated polynucleotide comprising a polynucleotide sequence selected from 
the group consisting of: 

5 (a) a polynucleotide having at least a 70% identity to a polynucleotide encoding a 

polypeptide comprising an amino acid sequence of selected from the group consisting of SEQ ED 
NO:224 through 477, 

(b) a polynucleotide which is complementary to the polynucleotide of (a); 

(c) a polynucleotide having at least a 70% identity to a polynucleotide encoding the 
10 same mature polypeptide expressed by the a gene comprising a polynucleotide sequence selected 

from the group consisting of SEQ ID NO: 1 through 223 contained in the DNA of the deposited 
clone; and 

(d) a polynucleotide comprising at least 15 sequential bases of the polynucleotide 
of (a), (b)or(c). 

15 2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim 1 wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising the nucleic acid sequence set forth in 
SEQ ED NO: 1. 

5. The polynucleotide of Claim 2 comprising a nucleotide sequence selected from 
20 the group consisting of SEQ ID NO : 1 through 223 . 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:224 through 477. 

7. A vector comprising the polynucleotide of Claim 1 . 

8. A host cell comprising the vector of Claim 7. 

25 9. A process for producing a polypeptide comprising: expressing from the host 

cell of Claim 8 a polypeptide encoded by said DNA. 

10. A process for producing a noval polypeptide or fragment comprising 
culturing a host of claim 8 under conditions sufficient for the production of said polypeptide 
or fragment. 

30 11. A polypeptide comprising an amino acid sequence which is at least 70% 

identical to an amino acid sequence selected from the group consisting of SEQ ID NO:224 
through 477. 

12. A polypeptide comprising an amino acid selected from the group consisting of 
SEQ ID NO:224 through 477. 
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13. An antibody against the polypeptide of claim 1 1. 

1 4. An antagonist which inhibits the activity or expression of the polypeptide of 
claim 11. 

15. A method for the treatment of an individual in need of a Streptococcal 

5 polypeptide comprising: administering to the individual a therapeutically effective amount of the 
polypeptide of claim 1 1 . 

16. A method for the treatment of an individual having need to inhibit a bacterial 
polypeptide comprising: administering to the individual a therapeutically effective amount of the 
antagonist of Claim 14. 

10 17 A process for diagnosing a disease related to expression or activity of the 

polypeptide of claim 1 1 in an individual comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

(b) analyzing for the presence or amount of said polypeptide in a sample derived from 
the individual. 

15 1 8 - A method for identifying compounds which interact with and inhibit or activate 

an activity of the polypeptide of claim 1 1 comprising: 

contacting a composition comprising the polypeptide with the compound to be screened 
under conditions to permit interaction between the compound and the polypeptide to assess the 
interaction of a compound, such interaction being associated with a second component capable of 
providing a detectable signal in response to the interaction of the polypeptide with the 
compound: 

and determining whether the compound interacts with and activates or inhibits an 
activity of the polypetide by detecting the presence or absence of a signal generated from the 
interaction of the compound with the polypeptide. 

19. A method for inducing an immunological response in a mammal which 
comprises inoculating the mammal with Streptococcal polypeptide of claim 1 1, or a fragment 
or variant thereof, adequate to produce antibody and/or T cell immune response to protect 
said animal from disease. 

20. A method of inducing immunological response in a mammal which comprises 
delivering a nucleic acid vector to direct expression of Streptococcal polypeptide of claim 1 1 
or fragment or a variant thereof, for expressing said Streptococcal polypeptide, or a fragment 
or a variant thereof in vivo in order to induce an immunological response to produce 
antibody and/ or T cell immune response to protect said animal from disease. 



20 
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21 . An isolated Streptococcal polypeptide having one of the amino acid 
sequences of SEQ ID NO:224 through SEQ ID NO:234. 

22. An isolated nucleic acid encoding one of the amino acid sequences of 
Claim 21 and nucleic acid sequences capable of hybridizing therewith under stringent 

5 conditions. 

23. Recombinant vectors comprising the nucleic acid sequences of 
Claim 22 and host cells transformed or transfected therewith. 

24. A method of identifying an antimicrobial compound comprising contacting 
candidate compounds with a polypeptide of Claim 21 and selecting those compounds 

10 capable of inhibiting the bioactivity of said polypeptide. 

25. Antimicrobial compounds identified by the method of Claim 24. 



348 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/05306 



I A. CLASSIFICATION OF SUBJECT MATTER 
IPC (6) :Plcaac See Extra Sheet. 

USCL :53cV23.7; 433/6, 69.1,252.3, 320.1 ; 530/324, 350; 514/12 
i According to International Patent C l a ssi fi ca tion (IPC) or to both national clarification and IPC 



FIELDS SEARCHED 



Minimum documentation searched (clarification system followed by classification symbols) 
U.S. : 536/23.7; 435/6, 69.1. 252.3 , 320.1; 530/324. 350; 514/12 

I Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 

| Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
Please See Extra Sheet. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 1 



X, P 

|Y, P 



Citation of document, with indication, where appropriate, of the relevant passages 



WO 96/10647 A1 (FUSO PHARMACEUTICAL INDUSTRIES, 
LTD.) 1 1 April 1996, claim 2, pages 22-24. 



WATSON, J. D. et al. Recombinant DNA, Second Edition. 
New York: Scientific American Books, W.H. Freeman and 
Company, 1992, pages 73 and 74, see entire document. 

KENNELL, D. E. Principles and Practices of Nucleic Acid 
Hybridization. Progress in Nucleic Acid Research Molecular 
Biology. 1 971 , Vol. 1 1 , pages 259-301 , see especially paces 
259-262. 



1 ] Further documents are listed in the 



continuation of Box C. 



1 1 See patent family i 



Relevant to claim No. 



1, 2, 7, 8, and 
22 



3, 6, 9-12, 15, 
17, 21, and 23- 
25 



1-3, 6-12, 15, 
17, and 21-23 



1-3, 6-12, 15, 
17, and 21-23 



•A* 



•E" 



i» too/ 



fee foaml awm of aw art wtica a mx 



Ifiatef **> 

on priority cfcmaU) or whicfa i 
dam of i 



later tfooaawos pubtkfaorf t/ter sae bmatkmi ftlaic dtaa or ■riorftv 
pr*Mnpic or Swory ^dcrtymt tfcc imMM 



to aa oral 



filac date but fccer ibam 



'oV 



j Date of the actual completion of the international search 
10 JULY 1997 




| Name and mailing address of the ISA/US 
Com mi— i oocr of Pataota aad Trsdcmarkj 
Box PCT 

Was hin g ton, D.C 20231 



I Facsimile No. (703) 305-3230 ' 

Form PCT/1SA/2 10 (second sheet XJuly 1992)* 



Date of mailing of the international search report 

0 6 AUG 1997 



Authorized officer 

. JAMES MARTIN ELL 
Telephone No. (703) 308-0196 



H96 L_Vj 



BNSDOCID: <WO 9737026A1 _!_> 



INTERNATIONAL SEARCH REPORT 



International Application No. 
PCT/US97/05306 



I OfervmtkMM where 



claims were found unsearchable (Cootinustioa of Men 1 of first sheet) 



Thk k+cmAUooAl report haa not been established in respect of certain claims under Article 17(2X») for the following i 



□ 



Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



|~J Claims Nos 



because they relate to porta of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. Q Claims Noa.: 

they are 



dependent claims and are not drafted in accordance with the second and third 



of Rule 6.4(a). 



II Observations where unity of invention is Lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 
See Extra Sheet. 



j~j As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 



2. j™] As all searchable claim* could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. [~j As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which foes were paid, specifically claims Nos.: 



Fx] No required additional search fees were timely paid by the applicant. Consequently, this international search report 
^""^ restricted to the invention Grst mentioned in the claims; it is covered by claims Nos.: 
1-3, 6-12, 15, 17, and 21-23 (SEQ ID NO* 224-233). 



Remark on Protest 



r""l The additional search fees were accompanied by the applicant's protest. 
I"""] No protest accompanied the payment of additional search fees. 



Form PCT/IS A/210 (continuation of first sheet* 1)M July 1992)* 



BNSDOC10: <W0 9737026A1 J_> 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCI7US97/05306 



BOX n. OBSERVATIONS WHERE UNITY OF INVENTION IS LACKING (Continued): 

This application contain! the following inventions or groups of inventions which are n«f so Linked as to form a single 
inventive concept under PCT Rule 13. 1. In order for all inventions to be searched, the appropriate additional search 
fees must be paid. 

Group I. Claims 1-12. 15, 17, and 21-23, drawn to nucleic acids and polypeptides of specific sequence, vectors 
containing nucleic acids, methods of producing polypeptides, and methods of treatment using polypeptides. 

Group II. Claims 13 and 19, drawn to antibodies against any of no fewer than 254 specific sequence polypeptides. 

Group ID. Claims 14 and 16, drawn to antagonists of undisclosed composition of the activity of no fewer than 254 
specific sequence polypeptides. 

Group IV. Claims 18 and 24, drawn to methods for screening for compounds that interact with no fewer than 254 
specific seque n ce polypeptides. 

Group V. Claim 20, drawn to s method of inducing an immunological response using a nucleic acid. 
Group VI. Claim 25. drawn to compounds of undisclosed composition. 

The inventions listed as Groups I- VI do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons. The nucleic 
acids, proteins, and methods of Group I are materially different from the antibodies of Group II. the compounds of 
Group m, and the compounds of Group VI and are not required to produce either the antibodies of Group II, which 
antibodies may be made with antigens other than those embraced by Group I. Additionally, the methods of Group I are 
not needed to produce the compounds of either one of Groups III or VI. The compositions and methods of Group I are 
not needed for the methods of either one of Groups IV or V. The antibodies of Group II arc not needed for the 
meth ods of Groups IV or V and are materially different from the compounds of Groups III and VI. The antagonists of 
Group HI arc materially different from the compounds of Group VT because the antagonists and compounds have 
differing sites of action and/or binding. Neither are the compounds of either one of Groups III or VI needed to practice 
the methods of either one of Groups IV or V. The methods of Groups IV and V may be practiced independently of one 

This application contains claims directed to more than one species of the generic invention. These species are deemed 
to lack Unity of Invention because they are not so linked as to form a single inventive concept under PCT Rule 13.1. 
In order for more than one species to be searched, the appropriate additional search fees must be paid. The species are 
as follows. Each of the 234 SEQ ID NOs listed is a separate species because there is no relationship between the 
species. 

The species listed shove do not relate to a single inventive concept under PCT Rule 13.1 because, under PCT Rule 
13.2, the species lack the same or corresponding special technical features for the following reason*. There is no 
relationship among the species. 

For any Group applicant elects, a total of 10 (TEN) specified sequences will be searched and no more than 4 (FOUR) 
specified seq u ences will be searched for each additional foe paid. 



The claims are deemed to correspond to the species listed above in the following manner: SEQ ID NOs 224-233 
correspond to claims 1-3, 6-12, 15, 17, and 21-23. 



A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

C12N 15/31, 15/11, 15/00; C12P 21/02; A61IC 38/16, 39/09; C071C 14/195. 14/315 

B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

MPSRCH OF: A-GENESEQ26, A- ISSUED, HIV-AA8, PIR50, SUIWWPROT34, EMBL-NEWl 1 , GENBANK97 
GENBANK-NEW1 1 # U-EMBL48_97, HIV-NA8, N-GENESEQ26. N-ISSUED. EST-STS, EST-STS-TWO, EST^TS- 



Fonn PCT/ISA/210 (extra shect)(July 1992)* 



9NS0OCID: <WO 9737026A1 J_> 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/O5306 



THREE. SEQ ID NOt 224-233. 



Form PCT/1SA/210 (extra tho*XJuly 1992)* 



